Ann Isabel SANZ MOLINERO 
Appl. No. 10/537,897 
Atty. Ref.: 4982-5 
Amendment 
November 18, 2008 

REMARKS 

Reconsideration is requested. 

The specification has been amended to include the attached Sequence Listing 
and sequence identifiers corresponding to same. The attached paper and computer 
readable copies of the Sequence Listing are the same. No new matter has been added. 

The Examiner interview of July 31 , 2008 is acknowledged, with appreciation. 
The Interview Summary mailed September 30, 2008 is accurate in the brief description 
of the issues discussed during the interview. 

Claims 1-4, 10-17, 19-23, 29-34 and 43-56 are pending. Claims 53-56 have 
been added. The new claims find support throughout the specification. No new matter 
has been added. Claims 53-56 are believed to read on the elected subject matter. 

The claims have been revised to read on the elected subject matter. The 
applicants note, for the Examiner's convenience, that SEQ ID NO:5 is the motif 
QALGGH, SEQ ID NO:7 is an EAR motif, SEQ ID NO:8 is a B-Box motif and SEQ ID 
NO:9 is an L-Box motif. SEQ ID NO:1 encodes a sequence containing the QALGGH 
motif, an EAR motif, a B-Box motif and an L-Box motif, as described, for example, in 
Figure 3 of the specification. 

The above amendments assume that the Amendment of March 31 , 2008 has 
been entered. The Examiner is requested to advise the undersigned in the event 
anything further is required for entry of the above claim amendments. 

The Examiner is requested to return a completely-initialed copy of the PTO-1449 
Form bearing the OIPE date-stamped of June 7, 2005. Specifically, the PTO-1449 
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Form returned with the Office Action of November 30, 2007, does not include the 
Examiner's initials next to foreign patent documents 96/39020 A, 1 230 843 A and 
01/90343 A. The entirety of the PTO-1449 Form has been initialed by the Examiner on 
November 19, 2007, and the other references indicated as having been specifically 
considered by initialing next to each of the references. The above-noted foreign 
documents however do not include in the left-hand column the Examiner's initials. A 
completely-initialed copy of the PTO-1449 Form, pursuant to MPEP § 609, is requested. 

The Examiner is requested to provide a complete PTO 892 Form, which includes 
the title of each cited Non-Patent Document so that the absence of same does not delay 
printing of the present application, or applications based on same, as a patent once 
patentable subject matter is identified and formal requirements are otherwise met. 
Moreover, lack of titles of each cited Non-Patent Document in an examiner's PTO-892 
can, in the undersigned's experience, cause difficulty in prosecution of subsequent 
related applications where a different examiner may believe a title is required for a 
reference which is not required to be submitted in the subsequent application and may 
be difficult to locate. The Examiner is again requested to see MPEP § 707.05(e) in this 
regard. 

The specification has been revised in the Amendment of March 31, 2008 to 
delete the browser executable code. Withdrawal of the objection to the specification is 
requested. 

The objection to claim 4 is obviated by the above amendments filed March 31 , 
2008. Withdrawal of the objection is requested. 
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The Section 112, second paragraph, rejection of claims 10-17 is obviated by the 
amendments filed March 31, 2008. Withdrawal of the Section 112, second paragraph, 
rejection is requested. 

To the extent not obviated by the above amendments, the Section 112, first 
paragraph "written description", rejection of claims 1-4, 10-17, 19-23 and 29-34 is 
traversed. Reconsideration and withdrawal of the rejection are requested in view of the 
above and the following comments. 

The applicants provide, for example, on page 14 of the application the essential 
regions set forth in SEQ ID NOs: 5, 7, 8 and 9 identified in SEQ ID NO: 2, which is the 
protein encoded by SEQ ID NO: 1 . 1 One of ordinary skill in the art will appreciate from 
the present specification that it is not necessary to include all the regions present in 
order to perform the methods of the claimed invention. 

The Examiner asserts that the specification allegedly fails to adequately describe 
polynucleotide sequences that encode any 2XC2H2 zinc finger protein or a portion of 
SEQ ID NO: 1 having the same activity as SEQ ID NO: 1. Page 6, lines 14-23 of the 
application describes variants of 2XC2H2 nucleic acids and their encoded proteins 
useful in performing the methods of the claimed invention and therefore having a similar 
activity to SEQ ID NO: 1 . Further the variants are described in detail on pages 6 to 16 of 
the present application. Page 7 details variant 2xC2H2 zinc finger nucleic acid having 
similar activity to SEQ ID NO: 1 . Moreover, examples of SEQ ID NO: 2 homologous and 

1 "Particular zinc finger homologues useful in the methods of the present invention have one or 
more of the conserved motifs as depicted in SEQ ID NO 5, 6, 7, 8 and 9, or motifs that are 80% identical 
to these motifs or motifs that have conserved substitutions of amino acids. The 2xC2H2 protein as set 
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orthologous proteins, and the nucleic acids encoding the same are given in SEQ ID NO: 
12 to SEQ ID NO: 25 which are further detailed in page 14 line 32-37 and page 15 lines 
1-2. Examples of homologues of SEQ ID NO: 2 and the encoding polynucleotides 
originating from the same species are given in SEQ ID NO: 26 to SEQ ID NO: 35 and 
further detailed in page 15 line 6-17. Furthermore examples of alternative cDNAs 
(polynucleotides) encoding SEQ ID NO: 2 and therefore having the same activity of 
SEQ ID NO: 1, which also encodes SEQ ID NO: 2, are given in page 15 lines 23-24. 
The last paragraph of page 15 provides a number of 2xC2H2 homologous proteins 
useful in the methods of the invention. Identifying the polynucleotides encoding such 
homologous proteins is in the realm of the skill in the art. 

In referring to Decisions of the Federal Circuit Court of Appeals, the Examiner 
states that a written description of an invention "requires a precise definition, one that 
defines structural features of the chemical genus that distinguishes it from other 
chemical structures". See page 4 of the Office Action dated November 30, 2007. The 
Examiner further states that the description of a genus of cDNAs may be achieved by 
reciting a representative number of nucleotide sequences falling in the scope of the 
genus. See paragraph spanning pages 4-5 of the Office Action of November 30, 2007. 

The applicants submit that the present specification provides the information 
which the Examiner asserts to be required. 

Page 15 of the application, for example, provides several cDNAs encoding the 
same protein as that of SEQ ID NO 2. Numerous examples of 2XC2H2 encoding 

forth in SEQ ID NO 2 comprises all the boxes as set forth in SEQ ID NO 5, 7, 8 and 9. All its paralogues 
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nucleotide sequences are described in the application, for example on page 15, five 
distinct examples of 2XC2H2 nucleotide sequences encoded in the genome of 
Arabidopsis thaliana are described. An exhaustive list is not believed to be required in 
the specification. The applicants believe however that the specification provides a 
sufficient representative number of examples that when taken together with those 
provided on page 15 and on Table 9 of the application, and the sequence listing, the 
figures, and the guidance provided on how to find other candidate sequences provides 
a precise definition of the 2XC2H2 genus, such that one of ordinary skill in the art will 
appreciate that the applicants were in possession of the claimed invention at the time 
the application was filed. 

Beyond the above, the applicants note that page 14 of the application, for 
example, describes the structural features of SEQ ID NO: 2 as set forth in SEQ ID NO 
5, 7, 8 and 9. Detailed characterization of each of these features and ways to identify 
them is provided in pages 11 to 13 of the application. Furthermore a substantial number 
of examples of 2XC2H2 nucleic acids and 2XC2H2 proteins comprising the structural 
features are cited in the patent application and sequence listing. 

Further, attached as Annex 3 is a sequence alignment of multiple sequences of 
2XC2H2 proteins as found in the sequence listing wherein positional motifs as 
represented by SEQ ID NOs: 5, 7, 8 and 9 are indicated by a box over the consensus 
sequence. Moreover, the attached Annex 4 provides a an alignment of multiple 
2XC2H2 proteins as described in the specification in the paragraph bridging pages 15- 

and orthologues also comprise all of these boxes." 
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16 with reference to accession numbers known in the art. The position of the motifs as 
represented by SEQ ID NOs: 5, 7, 8 and 9 are indicated by a box over the consensus 
sequence. Annex 6 attached hereto provides a calculation for sequences of the 
disclosure containing the motifs related to SEQ ID NOs: 7, 8 and 9 of the specification 
and claims. 

In summary, the applicants submit that there is ample written description 
provided in the present application providing ample structural information and a wealth 
of examples of polynucleotide sequences encoding 2XC2H2 proteins. Altogether, the 
structural features provided in the specification, the sequence in the sequence listing, 
and the guidance provided in the specification will lead one of ordinary skill in the art to 
appreciate that the applicants were in possession of the claimed invention at the time 
the application was filed 

Withdrawal of the Section 112, first paragraph "written description", rejection is 
requested. 

To the extent not obviated by the above amendments, the Section 112, first 
paragraph "enablement", rejection of claims 1-4, 10-17, 19-23 and 29-34 is traversed. 
Reconsideration and withdrawal of the rejection are requested in view of the above and 
the following comments. 

The applicants submit that one of ordinary skill in the art will be able to make and 
use the claimed invention without undue experimentation. The specification provides an 
enabling disclosure of the claimed invention. 
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The Examiner is requested to see the attached Annex 1 in this regard which 
provides further experimental data showing that polynucleotide sequences encoding 
proteins having less than 100% identity to SEQ ID NO: 2, such as SEQ ID NO: 26 and 
SEQ ID NO: 36, when transformed into a plant give plants having increased yield, 
increased leaf surface and prolonged vegetative growth. The identity between SEQ ID 
NO: 2 and SEQ ID NO: 27 (which is encoded by the nucleic acid in SEQ ID NO: 26) is 
42.2%. The identity between SEQ ID NO: 2 and SEQ ID NO: 37 (which is encoded by 
the nucleic acid in SEQ ID NO: 36) is 31 % (Annex2). 

Despite the relatively low overall sequence identity, the proteins encoded by SEQ 
ID NO: 26 and SEQ ID NO: 36 comprised one or more of the regions identified in the 
claims. 

The applicant has transformed rice plants with two different nucleic acids, one 
originating from Arabidopsis and a second one from rice, both encoding a 2XC2H2 
protein. The sequences are described in the present application as SEQ ID NO: 26 and 
SEQ ID NO: 36, respectively. All experiments were carried out essentially as described 
in the Examples section of the application. The transgenic rice plants expressing the 
Arabidopsis 2XC2H2 transgene AtSTZparalogl (SEQ ID NO: 26) showed increases in 
the parameters described in the Table I and Table II of Annex 1 . 

The transgenic rice plants expressing the rice 2XC2H2 transgene showed 
increases in the parameters described in the Table III of Annex 1 . 

Annex 5 attached hereto further describes transformation of rice plants with a 
nucleic acid as described in SEQ ID NO:28 of the present specification and which 
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encodes the polypeptide described in SEQ ID NO:29. The results of Annex 5 
demonstrate an increased yield. 

The Examiner is understood to be of the opinion that the state-of-of-the art- is 
such that one of skill in the art cannot predict which nucleic acid or portions thereof 
capable of hybridizing to SEQ ID NO:1 will encode a protein with the same activity as 
the protein encoded by SEQ ID NO: 1 . The applicant respectfully disagrees and the 
Examiner is requested to see, for example, pages 13-16 of the present application 
which provides ample teaching of the amino acid sequence and structural features 
comprised within proteins encoded by a nucleic acid having similar activity to SEQ ID 
NO: 1 and therefore useful in the methods of the claimed invention. 

Methods for the identification of 2xC2H2 zinc finger proteins having similar amino 
acid sequence and structural features to SEQ ID NO: 2 are well known in the art and 
provided in the present application. The Examiner is requested to see, for example, 
pages page 10 and page 1 1 first and second paragraphs in this regard. Furthermore, 
nucleic acid or portions thereof capable of hybridizing to SEQ ID NO:1 and encoding a 
protein with the same activity as the protein encoded by SEQ ID NO: 1 are illustrated 
by numerous examples in Page 7, second paragraph. 

The Examiner is further understood to be of the opinion that methods for the 
prediction of protein structure from sequence data in order to ascertain functional 
aspects of the protein are extremely complex and that the prediction possibilities of the 
positions within the protein's sequence where amino acid substitutions can be made 
with a reasonable expectation of maintaining function are limited. Though there may be 
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examples in the field that can support such statements, the attention of the Examiner is 
brought to the fact that such general statements do not apply to the presently claimed 
invention. 

Specifically, the methods for the prediction of protein structural features referred 
to in pages 10 and 1 1 of the application will not require undue experimentation and are 
straight forward to use. Such methods are routinely used in a successful manner by 
those in the art without the need for any non-routine work or undue burden. 
Performance of such methods is in the realm of those skilled in the art. 

The applicants further understand the Examiner to be of the opinion that the 
application fails to provide guidance for selecting a sequence that gives results when 
transformed into a plant. 

The applicant respectfully submits however that detailed guidance for searching 
and identifying sequences encompassed within the claims is given in the specification 
and the Example section illustrates/demonstrates the claimed invention in detail. As 
shown in attached Annex 1 , the applicant has used the method disclosed in the 
application for finding homologous sequences to SEQ ID NO: 1 encoding 2XC2H2 
proteins in Arabidopsis and rice genomes. Further the applicant has succeeded to 
isolate such homologous sequences (SEQ ID NO: 26 and SEQ ID NO: 36) and show 
that when introduced in a transgenic plant, the yield and the leaf surface are increased 
and the vegetative growth phase is prolonged. 
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Examples in Annex 1 further illustrate how the guidance provided in the present 
application is sufficient to enable the ordinarily skilled artisan to identify sequences 
useful in the methods of the claimed invention. 

The Examiner is further understood to be of the opinion that transforming plants 
with heterologous genes involved in plant development produce unpredictable results. 
Though there maybe examples in the field that illustrate such a statement, this is not the 
rule. 

A basic axiom in the genetics field is that which associates a phenotype to a 
genotype. The assumption that by transforming plants with a heterologous genes gives 
reproducible results is a basic principle widely used in the transgenic research field for 
example to prove the effect of a gene. 

There are numerous examples to illustrate that transformation of a heterologous 
gene in a plant results in reproducible effects. For example the Arabidopsis gene 
AtNHXI was reported to give salt-tolerance when transformed in Arabidopsis, such 
effect was reproduced when transformed in tomato and in Brassica napus. Further the 
transformation of homologous genes to AtNHXI originating from Atriplex gmelini in rice 
plants resulted in the predicted effect of improving salt tolerance. 

Another example is that of the Arabidopsis thaliana CBF1 transcription factor, 
which when transformed into Arabidopsis thaliana, Brassica napus and tomato plants 
consistently resulted in abiotic stress tolerance; and produced the same effect, that is, 
increased stress tolerance. 
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In summary, the specification as filed provides sufficient guidance to identify, 
isolate and use 2XC2H2 nucleic acids useful in the methods of the claimed invention, 
without undue experimentation. Furthermore the data in Annex 1 provide evidence that 
the predicted result of increasing expression of such nucleic acids in a plant is enabled 
over the scope of the claims. 

On page 10 of the Office Action dated November 30, 2007, the Examiner argues 
lack of disclosure of methods to isolate sequences encompassed by the claims. The 
Examiner is requested to see however the present specification where methods to 
search and identify the sequences encompassed by the claims are described, as 
detailed above. Several alternative methods are set forth. Several examples are 
provided. Alternative methods to isolate nucleic acids of interest are well known in the 
art (Sambrook et al 2001 , page 8, line 6 of the Application). Further and as shown in 
Annex 1, the applicant has used the method disclosed in the application for identifying 
homologous sequences to SEQ ID NO: 1 encoding 2XC2H2 proteins in Arabidopsis and 
rice genomes. Furthermore the applicant has isolated such homologous sequences 
(SEQ ID NO: 26 and SEQ ID NO: 36) and shown that when introduced in a transgenic 
plant, the yield and the leaf surface are increased and the vegetative growth phase is 
prolonged. Examples in Annex 1 further illustrate how the guidance provided in the 
application is sufficient to enable one of ordinary skill in the art to identify sequences 
useful in the methods of the claimed invention. Altogether, the teaching and guidance in 
the specification and the illustration with the various examples fully enables the 
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ordinarily skilled artisan to isolate and identify the sequences encompassed in the 
claims. 

Withdrawal of the Section 112, first paragraph "enablement", rejection is 
requested. 

The Section 102 rejection of claims 1-4, 10-17, 19-23 and 29-34 over Pineda 
(WO 01/36598 A1), is obviated by the above amendments. Reconsideration and 
withdrawal of the rejection are requested as the cited document fails to teach or 
suggest, for example, selecting for plants having increased yield, as required by the 
claims. 

The claims are submitted to be in condition for allowance and a Notice to that 
effect is requested. 

The Examiner is requested to contact the undersigned, preferably by telephone, 
in the event anything further is required in this regard. 



Respectfully submitted, 
NIXON & VANDERHYE P.C. 



By: IB. J. Sadoff/ 

B. J. Sadoff 
Reg. No. 36,663 

BJS: 

901 North Glebe Road, 11th Floor 
Arlington, VA 22203-1808 
Telephone: (703)816-4000 
Facsimile: (703)816-4100 
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Annex 1 
Example A 

AtSTZparalogl: SEQ ID NO: 26 under the control of constitutive promoter GOS2 

A DNA fragment encoding a 2XC2H2 protein represented in the application as filed by SEQ ID 
NO: 27 was isolated from an Arabidopsis thaliana seedling cDNA library (Invitrogen, Paisley, 
UK) by PCR amplification and subsequent cloning in an entry clone vector according to the 
methods described in Example 1 of the present application. SEQ ID NO: 27 polypeptide was 
encoded by the longest open reading frame of SEQ ID NO: 26 (AtSTZparalogl). 

The primers used for the PCR amplification were as follows: 

Forward primer: Ggggacaagtttgtacaaaaaagcaggcttaaacaatggccctcgaagcg 
Reverse primer: Ggggaccactttgtacaagaaagctgggttcgagtattagatttttaaagataaatc 

The entry clone was subsequently used in an LR reaction with a destination vector used for rice 
transformation to generate the plant expression vector pGOS2::AtSTZparalog1. The 
constitutive promoter, GOS2, was mentioned in Table 10 on page 48 of the application as filed 
as being a promoter useful in the methods of the invention. 

Phenotypic characterization of the transformed plants was carried out essentially as described 
in Example 3 of the present application. The results are shown in Table I below. 

Table I: Results of phenotypic characterization of T2 rice plants transformed with 
pGOS2::AtSTZparalog1. 



pGOS2::AtSTZparalog1 


Parameter 


% increase in the transgenic 
plants compared to the 
nullyzygous plants 


Aboveground 
area 


10 


Root Area 


4 
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pGOS2::AtSTZparalog1 


Parameter 


% increase in the transgenic 
plants compared to the 

ni ill\/7\/nni iq nlpntQ 
i iuiiyz.yyuuo jjicii no 


Total Seed 
Weight 


48 


Number of 
filled seeds 


46 


Total number 
of seeds 


12 


Seed filing 
rate 


30 


Flowers per 
panicle 


8 


Harvest 
index 


35 


Days to 
flowering 


4 



The above results show that overexpression of the nucleic acid represented by SEQ ID NO: 27 
(encoding the 2XC2H2 zinc finger protein represented by SEQ ID NO 26) under the control of a 
constitutive promoter (GOS2) gives: 

- Increased plant yield (in the form of increased root area, total seed weight, total number 
of seeds, number of filled seeds, seed filling rate, flowers per panicle and harvest index); 
Increased leaf surface area (as manifested by increased aboveground area); and 
Prolonged vegetative growth (where the time to flower was on average 4% longer in 
transgenic plants compared to corresponding nullyzygous plants). 



Example B 

AtSTZparalogl: SEQ ID NO: 26 under the control of seed-specific promoter 
prolamin 

A DNA fragment encoding a 2XC2H2 protein represented by SEQ ID NO: 27 was isolated from 
an Arabidopsis thaliana seedling cDNA library (Invitrogen, Paisley, UK) by PCR amplification 
and subsequent cloning in an entry clone vector according to the methods described in Example 
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1 of the present application. SEQ ID NO: 27 polypeptide was encoded by the longest open 
reading frame of SEQ ID NO: 26 (AtSTZparalogl). 

The primers used for the PCR amplification were as follows: 

Forward primer: Ggggacaagtttgtacaaaaaagcaggcttaaacaatggccctcgaagcg 
Reverse primer: Ggggaccactttgtacaagaaagctgggttcgagtattagatttttaaagataaatc 

The entry clone was subsequently used in an LR reaction with a destination vectors used for 
rice transformation to generate the plant expression vector pPROLAMIN::AtSTZparalog1 . The 
seed-specific promoter, prolamin, was mentioned in Table 10 on page 48 of the application as 
filed as being a promoter useful in the methods of the invention. 

Phenotypic characterization of the transformed plants was carried out essentially as described 
in Example 3 of the present application. The results are shown in Table II below. 

Table II: Results of phenotypic characterization of T2 rice plants transformed with 
pPROLAMIN::AtSTZparalog1. 



pPROLAMIN::AtSTZparalog1 


Parameter 


% increase in the transgenic 
plants compared to the 
nullyzygous plants 


Aboveground 
area 


4 


Total Seed 
Weight 


8 


Number of 
filled seeds 


7 


Total number 
of seeds 


5 


Flowers per 
panicle 


4 


Harvest 
index 


5 
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The above results show that overexpression of the nucleic acid represented by SEQ ID NO: 27 
(encoding the 2XC2H2 zinc finger protein represented by SEQ ID NO 26) under the control of a 
seed-specific promoter (prolamin) gives: 

- Increased plant yield (in the form of increased aboveground area, total seed weight, total 
number of seeds, number of filled seeds, flowers per panicle and harvest index); 

- Increased leaf surface area (as manifested by increased aboveground area); and 

Example C 

OsSTZ(ortholoq): SEQ ID NO: 36 under the control of root-specific promoter RCc3 

A DNA fragment comprising the coding region of SEQ ID NO 36 was PCR amplified and cloned 
using methods essentially as described in the Examples section of the present application. 

The primers used for the PCR amplification were as follows: 

Forward primer: ggggacaagtttgtacaaaaaagcaggcttaaacaatgtcgagcgcgtcgt 

Reverse primer: ggggaccactttgtacaagaaagctgggtctgaattacgcggtgagaag 

A plant transformation vector carrying the coding region of SEQ ID NO 36 under the control of 
the root specific promoter, RCc3, was made, giving construct CD10315 described in Table 9 on 
page 47 of the present application. Agrobaterium-mediated transformation of rice plants was 
carried out to generate transgenic rice plants carrying the construct CD 1031 5. 

The results of the phenotypic evaluation of the CD10315-transgenic plants of the T2 generation 
are shown in Table III below. 

Table III: Results of phenotypic characterization of T2 rice plants transformed with 
pRCc3::OsSTZortholog. 
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CD10315 plants 


Parameter 


% increase in the transgenic 
plants compared to the 
nullyzygous plants 


Aboveground 
area 


6 


Root Area 


5 


Total Seed 
Weight 


29 


Number of 
filled seeds 


29 


Seed filling 
rate 


16 


Total number 
of seeds 


17 


Harvest 
index 


25 



The above results show that overexpression of a nucleic acid encoding the 2XC2H2 zinc finger 
protein represented by SEQ ID NO 36) under the control of a root-specific promoter (RCc3) 
gives: 

Increased plant yield (in the form of increased aboveground area, root area, total seed 
weight, total number of seeds, number of filled seeds, seed filling rate and harvest 
index); and 

- Increased leaf surface area (as manifested by increased aboveground area). 
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Annex 2 

Sequence identity between SEQ ID NO: 2 and SEQ ID NO: 27 and SEQ ID NO: 37. 

Amino acid sequence identity between SEQ ID NO: 2 and SEQ ID NO: 27 (encoded protein in 
SEQ ID NO: 26) and SEQ ID NO: 37 (encoded protein in SEQ ID NO: 36) was determined using 
the Needleman and Wunsch as described in page 10 paragraph 2 of the Application. 

Results for the comparison between SEQ ID NO: 2 and SEQ ID NO: 27 are given in Table 1 
and Figure 1. 

Results for the comparison between SEQ ID NO: 2 and SEQ ID NO: 37 are given in Table 2 
and Figure 2. 

Sequence identity between SEQ ID NO: 2 and its homologous protein SEQ ID NO: 27 is 42.2 
%. Highly conserved domains corresponding to the SEQ ID NO 5 (motif within C2H2 zinc finger 
domain), 7 (EAR motif), 8 (B-box ) and 9 (L-box) are readily identifiable in both SEQ ID NO: 
27 and 37. 



Table 1. Similarity and identity between SEQ ID NO: 2 and SEQ ID NO: 27. 



Algorithm 


Query 1 


Query 1 


% 

identity 


% gaps 


% 

similarity 


Needleman and Wunsch 


SEQIDN027 


SEQIDN02 


42.2 


22 


53.5 


SEQIDN027 1 MALEAMNTPTSSFTRIETKEDLMNDAVF 
1 1 1 1 1 : • : 1 • • : II • : 1 1 


--IEPWLKRKRSKRQRSHSPSS 4 8 
: 1 • 1 • 1 • 1 1 1 1 1 • 1 1 



SEQIDN02 1 MALE AL T S PRLAS P I P PL FE D SSVFHGVEHWTKGKRSKRSRS 42 

SEQIDN02 7 4 9 SSSSPPRSRPKSQNQDLTEEEYLALCLLMLAKD--QPSQTRFHQQSQSLT 96 



SEQIDN02 43 DFHHQNLTEEEYLAFCLMLLARDNRQP P 7 0 

SEQIDN02 7 97 PPPESKNLPYKCNVCEKAFPSYQALGGHKASHRIKPPTVISTTADD S 143 

I I I . . : . I . I I I : I I : I . I . I I I I I I I I I I I I I : | . . . | | | 

SEQIDN02 71 PPPAVEKLSYKCSVCDKTFSSYQALGGHKASHRKNLSQTLSGGGDDHSTS 12 0 

SEQIDN02 7 144 TAPTISIVAGEKHPIAASGKIHECSICHKVFPTGQALGGHKRCHYEGNLG 193 
: I • I • I • I • • • I I I • I • I : I I : I . I I : I I I I I I I I I I I I I I I 
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SEQIDN02 121 SATTTSAVT TGSGKSHVCTICNKSFPSGQALGGHKRCHYEGN-- 162 

SEQIDN02 7 194 GGGGGGSKSISHSGSVSSTVSEERSHRGFIDLNLPALPELSLHHNPIVDE 243 

: . I : I : I II I I I I I I I I : I • : I I . I : . : . I : 

SEQIDN02 163 --NNINTSSVSNSEGAGSTSHVSSSHRGF-DLNIPPIPEFSMVNG DD 206 

SEQIDN02 7 244 EILSPLTGKKPLLLTDHDQVIKKEDLSLKI-- 273 

I : : I I : • • I I I : . | . . : | : 

SEQI DN02 2 0 7 EVMSPMPAKKP RFDFPVKLQL 22 7 

Figure 1. Alignment between SEQ ID NO: 2 and SEQ ID NO: 27. 



Table 2. Similarity and identity between SEQ ID NO: 2 and SEQ ID NO: 27. 



Algorithm 


Query 1 


Query 1 


% 

identity 


% gaps 


% 

similarity 


Needleman and Wunsch 


SEQIDN02 7 


SEQIDN02 


31 


31 


42.2 



SEQIDN037 


1 


MSSASSMEALHAAVLKEEQQQHEVEEATWTSSSATSGEEGGHLP 

1 1 : 1 

MALE AL T S PRLAS P I P PL FE D 


45 


SEQIDN02 


1 


21 


SEQIDN037 


46 


QGWAKRKRSRRQRS EEENLALCLLMLARGGHHRVQ 

: • 1 • 1 • 1 1 1 : 1 • 1 1 1 1 1 • 1 1 • 1 1 : : 1 1 1 • • • 1 

SSVFHGVEHWTKGKRSKRSRSDFHHQNLTEEEYLAFCLMLLARDNR Q 


80 


SEQIDN02 


22 


68 


SEQIDN037 
SEQIDN02 


81 
69 


APPPLSASAPPPAGAEFKCSVCGKSFSSYQALGGHKTSHRVKLPTPPAAP 

•III 1 : | | | | | . | : | | | | | | | | | | | .|||.. | . 

PPPP PAVEKLSYKCSVCDKTFSSYQALGGHKASHRKNLS 


130 
107 


SEQIDN037 


131 


VLAPAPVAALLPSAEDRE PATSSTAASSDGMTNRVHRCSICQKEFPTGQA 
. . | : | . . . : : : : | . : : : . : . | . | : | | . | . | | : | | | 

QTLSGGGDDHSTSSATTTSAVTTGSGKSHVCTICNKSFPSGQA 


180 


SEQIDN02 


108 


150 


SEQIDN037 


181 


LGGHKRKHYDGGVGAGAGASSTELLATVAAESEVGSSGNGQSATRAFDLN 
111111-11:1 : : 1 : | | : . : . . | : . | . | | | | 

LGGHKRCHYEGNNNINT SSVSNSEGAGSTSHVSSSHRGFDLN 


230 


SEQIDN02 


151 


192 


SEQIDN037 


231 


LPAVPEFVWRPCSKGKKMWDEEEEVQSPLAFKKPRLLTA 2 69 

: 1 . : 1 1 1 . | . : . : : | | . | | : . . | | | | 

IPPIPEF SMVNGDDEVMSPMPAKKPRFDFPVKLQL 22 7 




SEQIDN02 


193 





Figure 2. Alignment between SEQ ID NO: 2 and SEQ ID NO: 37. 
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Annex 3 



Multiple amino acid sequence alignment of 2XC2H2 proteins as found in the in the amino acid 
of the sequence listing of the application as filed. The position of the motifs as represented by 
SEQ ID NO, 5, 7, 8 and 9 are indicated by a box over the consensus sequence. 



SEQIBNO_l 1 JDat i sca._gi omerata ( 1 ) 

SEQTDNO_13_Glycine_max {1} 

SEQIDNO_15__Medicago__sativa (1) 

SEQI DNO_17_Ni cot iana_tabacum (!) 

SEQIDNO_19_Oryza_sat iva {1} 

SEQIDNO._37__pryza._ss at iva {1} 

SBQIDNO_23_Triticum_aeBtivum (1) 

SEQlDNO_2 7__Arabidops ia_thaliana ( 1 } 

SEQIDNG_3l_Arabidopsis_t ha liana (1} 

SEQIDNO_35_Arabidopsis_t ha liana (1) 

Seqidno__3 9___Arabidopsis _tiialiana ( 1 ) 

SEQlDNO__50_Arabidopais_thaliana ( 1 } 

SEQTDNO_46_Arabidopsis_thaliana ( 1 } 

SEQIDNO_2 l_Petunia_hybr ida ( 1 ) 

3EQIDNG_25_Capsicum_arniuurn (1} 

SEQlDNO_2 9___Arabidopsis_t ha liana ( 1 ) 

SEQIDNO_33_Arabidopsis m thaliana ( 1 ) 

Consensus (1) 



L so 

VJ\ VEAIM S ? T T - - -ATPVF H YDDP-S LN Y§E P§T| 

r*TTTSAENS?Ti TAPSF P FDD FT I Pfe£ 

WS0SS^^ T - ■■ ~ - ATP FT P- - FEEP^-LSYLETP^Tg 

K r|$&e£K&^A ATPTL PPRY EDDD - E IKNfD SmS> 

MSSAS^EAIr! "A.'." : T.KEEQQQKEVEEATWTSS£ATSQEEGGHLPQG:WAl 
MSSAS||^^pi§AVLKEEQQQHEVEEAT WTS S :PP T S GEE-- OH " PQTWAK 

- MS S S AMEALi -AT I P EQHQLDV EAAAA VS GAT S G EE S G 3 • V§Q G&A& 

KATEAMNTPTS SFTRI ETKEDLMN ■• DAVF§E PWLK 

,MS |iS TTTTTAP PPFLRCItDETE P ENjE SWT?! 

|T^gSf A TTTAR PLLRYRS EME P EMgEQ^AK 

-llQiElGiDQTQI I KGKRTKRQRSS - ~ STFWTAATTVTSTSSSAG 



MALEAiNSPT 

---iVTAl.EALN^PTG- 
MALE :E!SPTL 
P^I^AI^SPPP 

MALEALNSPT 



TTTPP SFQFEN5STG- -LKYlES^TK 

TPTPP PPQPESDGQQLRY§EK^h| 

S3PMP T LFQDSALGgHG 

VEDP LRFNGf EQ:W1 K 

L W K" 



SEQIDNO__Xl_Datisca_glonierata (3 2 ) 

SEQ I DNO_ 1 3 J3 1 y c i r^ max (28) 

SEQlDNO_15_Medicago_sativa (3 3) 

SEQIDNO__17_Nicotiana_tabacum (3 5) 

SEQIDftO_l 9 jDry z a_s at iva (51) 

3EQIDMO_3 7_Orysa_sativa (51) 

SEQIDNO_23_JTriticum_aestivum {46 ) 

SEQXDNO_2 7_Arabidopsis_thaliana (3 5 ) 

SEQI DNO_3 l_Ar ab idop s i s_tha I i ana (39) 

SEQIDMO_35_Arabidopsis_tHaliana (36) 

SEQIDNO_3 9_Arabidopsis_thal iana { 1 ) 

SEQIDNO_5 0_Arabidopsis_th.aIiana (4 5 ) 

SEQIDNO_46_Arabidops i s_thaliana { 1 ) 

SEQIDNO_21_Petunia__hybrida (33) 

SEQXDNO_25_Capsicum_annuum (36 ) 

SEQIDNO_2 9 _Arabidop s i s_thal i ana (29) 

SEQI DMO_ 3 3 _Arab 1 dop s i s_t hal i ana (27) 

Consensus {51) 



51 



RMKPTRTtSP^- 
)HPS-- 

CRPRI&AP-" 

IRC""" 



100 

■ --ElTsSFi'LArC JMT-ARGR 
HEEYLATC^1M__AHTG 
- - - CfEEE YTALC -.TMT ARSG 

.gE^EY^ATG.TTr ARSG 

• ■ EEETLAI.C-^.ARGG 



m m&<m — mmmm-m- mm 

RjK RGK: i-R3HSPSSSSSSPPRSRPKSQNQDLT^3FTLATC^^r,A|DQ 

r| - - r;tk.- m i |q pnp p i s e;£e'yla:uc : ;m : arg s 

R(£- »^gRQ||FiHGHQNQ ETNKNLPjjgBgYLALC _,LM_ ARG 1 

MKTTRSBYEES - M K.. TP X v E .' \ ' -S5i S F 

GS- -GGERAV|i'EYNSAVSSPVTTDCTQ- - EEi!Dj|g§dLlM- ARGT 

MVARSE E - - VEI vigTAil^r^I^Vl 

Gt- - |g§RQ|||MERQ C§EEE Vi-ALC JlHTARSD 

GK R^tesfeiMHHQ- FTEEEYLAEC .XM.. ARJ.G 

SKGKRSK- ::HSBFDR - - - - QGLTEDFYIATCPMi;P ^RPG 

C|- Kj^jgRS^^ KHHN HRl||eETLA.- C -» PA FJTG 

[K RSKR Rsj p T E jEE YLAL CI 7l MljAR G 



SEQ ID NO 8 



SEQ ID NO 9 



SEQlDNO_ll_Dati s ca__gl omerata ( G2 

SEQIDNO__13_GIyc ine_max ( 5 6 

SEQiDNO_i5_Medicago_sativa (64 

S EQIDNG__1 7_Nicot iana_tabacum ( 6 5 

SEQIDWO_19_Oryza__sativa (76 

SSQIDNO_3 7__Oryza_sativa (76 

SEQIDNO_23_Triticum_aestivum {71 

SEQlDNO_2 7__Arabidopsis_thai iana (83 

SEQIDNO_ 3 l__Arabidops i s__t ha 1 iana { 7 2 

SEQIDNO_3 5_Arabidopsis_ttial iana (76 

SSQIDNO_3 9_Arabidopsis_thaliana ( 2 9 

SEQIDNO_5 0_Arabidopsis_thaliana (87 

SEQIDNO_4 6__Arabidopsis_thaliana (27 

SEQIDNG_21_Petunia_hybrida (64 

SEQlDNO_2 5_Capaicum_annuuTn (67 

SEQIDNO_2 S_Arab idopsi s_thal iana { 6 4 

SEQIDNG_33__Arabidopsis_thaXiana (62 

Consensus (101) 



101 150 

VA SAHRRDSQS SIQIQP HATTSATKVSf 

TT TVHNRHVSPPPLQiQPQ PTPDPSTKESi 

NN - - - - -NDKKSBSVATP- LTTVKLs| 

TG TRTGLTDAT*l l SQQlADKKTAEIiPPVHKKEVATEQAEQS| 

KH RVQAP P PESAS API PAG AE§ 

HH RVQAP PPLSASAP| PAG AE| 

KQ RVQAPQPESFAAPVPA E§ 

PS QTRFHQQSQSLTP|P ESKNL p£ 

SD BHS PPSDKHS LS P LSDHQK 

AV QSFPLPPE-PSRASgSDHR Dj| 

WK Q I DVKQSTGS KTNHNN Hf 

VLPSPDEK NSRKIKQKISSENSS FYVf 

EGG -GGGEKR V§ 

VNKSRSLPPPPLPlSVPVTSQ INATLLE -QKNL| 

GS VNHQRSLPPPAPVMKLiHAPSS SSAAEEEKEKMVff 

DR NRDLDLPSSSSSgP LLPPLPTPI$j 

G DEDSVTVAEK§ s| 

P 
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151 200 

S EQ I DNO_Il_D at i s c a_q 1 omer at a • i LL £2y — KArS^Y:/ - G - jICA. i P222 A 4 JD QST3F AT--TM 

SEQ IDNO_l 3 _G1 y c i ne_max (86) KCSVCDKgF F SYT-A- CGHKAShRKLAG -AE DQPPS T- -TT 

SEQI DNG_1 5 _Med i c ago_s at iva \ - 5 K S| VJU3' Y - RA RICAVM§AT TAEDQ IT- -TT 

SEQIDNO 1 ?Jicot i ana tabacurr. f 2 06) K'_SV J- KAr £2 V . _ - .IKA l PIC; TTfAT- - - AASDDNNPST- - ST 

SEQIDNO^IS Orysa^sativa ( 9 G ) K C 3 V C G IS|§ F '1| S YQ22 L-G G 1 1'MiT S KB V K L P|§ P P AAPVLAPAPVAA- - LL 

SEQIDNO_3 7_Oryza_3 at iva i c i K, S V * -X$i £3. :y s 2 I 1 ;K . GHgVKLPfPP AAPVLAPAPVAA- ~ LL 

SEQIDNO 2 3 Trit icuin aeot ivuit (91} KCSV /CKSFSSYC^ 02 J K S H RV KQ pj|p PSD AAAAP LVAL PAVAA - - 1 L 

SEQIDNC_2 7_Arab idop s is_t hal iana (107) gCN^GE^FPSf QAbGG" iKA2 i-RIKFP^I STTADDS ~TA- - PT 

SE QI DNO_ 3 l_Ar ab i dop s i s_ t hal i ana ( 95 ) t£CSVrCK!3F i SYC A-..GC.-iK_ 2 2RKPVSVDV NNSNGTVTNN - - GET 

SEQIDNC_3 5_Arab idops i s_chaliana (98 } icScC^FfsiQALGGH| rSH^PTN|SI TSGNQELSNN - - SH 

SEQIDNO_3 9_Arabi dope i s_t hal iana (50) !:-CKTCNpKPDSgQALGGH^SHg|PKLIVD 

EEOlDNO_50_Arabidops i s_t hal iana (114) EC K T CN||t F^S^QAXjG G r rRA"T-"K^PRTpTEEKTRLPLTQPKSSASESGQN 

SEQI DN0_4 S_Ar ab idop s i s_t hal iana (38) |CKTCLlEF|S|QALGG3H^SKgLIw|sD P S - 

SEQIDKTO_21_Petunia_hybrida ( 99) KC0^CG^GF'."'Y2:~ ■ " " - rA^FRKLVSMGG-- - - - DEQSTTSTTT - - NV 

SEQIDNC_ 2 5 _Cap s icum_ annuum : 2 0 3 } K C:S V 2 G K r - 1 '' '2 2: Y ; .2i 1 GG. .KA2: ^LVPGG DDQSTTSTTT ■■ ■ HA 

SEQI DNO_2 9_ Ar ab idop s i s_t hal i ana (90) K C ' SVC G ?-2A 2' S 2 Y 2 • 2. 2G- :GKAG GRASPS LTQ SAGGD •- ELST- - SS 

SEQIDNO___33_Arabidopsis_thaliana (76) gCGgCYjjjfeT Fgs feAX,GG ]l| gs Figs L YGGGE 

Consensus (251) KCSVC KS FS S Y|QALGGH] KASHRK T 

SEQ ID NO 5 



SEQIDNO! IJDat i s ca_gl oraerata 
SEQIDHO_13_j31ycine_max 
SEQlDKO_15_Hedicago_sativa 
SEQIDNO__17_Micot iana_tabacum 
SEQXDNO_19_Oryzs_sativa 
SEQXDNO_3 7_Oryza_sativa 
SEQI DNO_2 3 _T r i t i cum^ae s t i vum. 
SSQI DN0_2 7_Ar abidop s i s_tha 1 iana 
S EQ IDNO_3 l__Ar abidop s i s__thal iana 
SEQXDNO_35J\rabidopsis_tha liana 
S EQ IDNO__3 9_Ar ab idops i s__t ha 1 iana 
S EQ IDN G_5 0_Arab idops i s__t hal iana 
S EQ lDNO__4 6 __Ar ab Idop sis_ thai iana 
SEQIDNO__2 l__Petunia__hybrida 
SEQIDM0_2 5_Capsicum_annuum 
S EQ I DNO_29_Ar abidop s is___tha liana 
5EQIDHO_33_Arabidopsis_J:haliana 
Consensus 



201 250 

(12 9) SATV tlT I ASGG - 2l|iHLC|I : CHf S GpfGQG LGGhf RGHllg 

(124) SSAA AjsgASG- - -jjpkHECST HK.3 PTGO - ! -GG:-. K.-.C.-1YE<3 

(124) SSAV ||§S|§ASNG - KI^HEcfelCHlK^ FPT ^ 1 . * K C"-YEG 

(148) STGA VNI§ALNP~|^fcHVCBXCHKArPT2 „2,LG22^ 221 i EG 

(142) PSAEDRE PATSgj|A|gS SDGM§Nf VHRC^l C QKE 2 PTG QALGGI-^RKH^p 

(142) PSAEDRE PATSlSApS SDGMpW^v 7 H K C £-i C QkE v PTG 0 22..- ? C - n K K; KH^^ 

(139) PSAEP---ATS^AgsSDGA|NiVKRCSI q|eF^GC - 3GEi|rKH^ 

(14 8) TSTV- -.r:H r ;H22AASGK2i:L2Sl22tvv2. pT,2.-. .: •. 2 ! K;- C- iYEG 

(13 7) IS NG LVGQi^HNCgi C FK-i r PS- 2.2 , G22K:-: C,:YDG 

(14 0) SNSGSWI^VTWTGNGVSQgl HTCSjCF^S FA|GQALGGHi^H^ 

{80) - Q EQVKHRNKENDMHKCf 1 20ti22GTG QALGGHM? KH R 72 : 

(164) SHFKVSGSALAgQAgNllNKAN|VHE2SJ J 7GSEFTi(3QALGGHMRRHRTf 

(70) LLG |LiNKKTK|g-xf H PC 2.£ C G VE F^MG Q ALGG.H MR Rh r R 3 E 

(141 ) TG-T fiANVHGN- -^HECSI 2HKC^ PTG^ALG:3i:KF-CH^ 

(144) TGTT Svi^GNGNR^^JiEC#iCHiCFpiGQ ALGGHKRgH^ 

(131) AITT ^ifGGGGGlv^HVCBCHiSFAiGQALGGHK X222BG 

(106 } NDKfTP-STAViipWC^3GiSFAi:;QALGGH|RiH^ 

(201) ST S SGKSH CSIC K F PTG^ ALGG H|KRCH YDG 



SEQIDNO_ll__Datisca_glomerata 
SEQIDM0_1 3_Glyc ine_max 
SEQIDN0_l5_Medicago_sativa 
SEQIDN0_17_Nicotiana_tabacuTU 
SEQI DNO_l 9_Ory za_s at iva 
SEQIDNO^B 7_Ory z a_sat iva 
SEQiDNO_2 3_Triticum_aestivum 
S EQI DMO__2 7_Ar ab i dop s i s__thal i ana 
SEQIDNO_3 l_Arab i dop s i s thai i ana 
SEQiDNO_3 5_Arab idop sis_thal iana 
SEQIDNO_3 9_Arab idop s i s_thal 1 ana 
SEQIDWG_5 0_Arab idop& i s_thal iana 
S EQIDNO_4 6_Arab idops i s__t hal iana 
SEQIDNO_21_Petunia_hybrida 
S E Q I DN o_ 2 5 __C apsic um_ annuum 
SSQIDNO_2 9_Arab idop sis_thal iana 
SEQI DNO_ 3 3 Ar ab i dop s i s_t ha 1 i an a 
Consensus 



SEQ ID NO 5 

251 300 

(171} S llGNS I HHHNNTTNSGSNGGMSMT|E VG^HTVSHSHEDjp^^^^lP 

(164 } NGNGN NNN S NS VVT VApEG VG SpH T V SHGHHR t : F D In v IP A I: P £) 

(16 6) SV| AGAG^GSN§VgASEGVGliSHgHHRnPDLI,XPA • PD 

(190) Kl|gn S RDLGGGGGGGHSGiVLTTSDGGASTHTLgDFp.Ii^P A2PE 

(19 2) G VgAGAGAS S T 2:^l:A277 ( il: :S:222SSGY2GSA'~'R2Fr)IJTI,PAVPE 

( 192 } GV^AGAGASST ELL^TVApEpEVG^pG2. G^cL". 2ii.2F'ZjliHt4pAV?E 

(18 6) " - .^VGAAAS S T ELLgAAAlEgEVG^ClC SS-^RAFDLNI I ) i fepB 

(191) NTLfGG GGGGSKf I SH|G|VSSiVSEERiHRGF I gpgPALPE 

(175) GN C^NfWGDNf HKFDLNL PADCjfSD ET§GK 

(190) GNM GNGNGS^NSVELVAGSDVSf^DNERW 

(119) MITEQS IVPS VVYSRP VFNRCS|S KE 1 Lg^T PfEN 

(214) VTTISPVAATAEVS - -RNSTESEIEINIG^MEQQRKYLPL|&8SlI PG i 

fill) KA2 PC TLVTRgFLPEll I VI T iKKS SSG KRVACL"JLT;$HE 2 

(181) GN-|NGNG iVSVGv|sSEGVGSTIgHH^SW86iBtt 

(187) GllNGNAN SGVfASVGV|sSEG VGSTVS » HRC FDOIXPALPE 

(174) KNiGG VSSiVSNlEDVGSpHV22.2i - KRCFDLKXPPpgg 

(144) - -G- - VSN|EGVGSHHVSS|SHi 2T,2t'?XIPf QG 

(251) G S S S SS S R [FDlSiPA| LPE 



SEQ ID NO 7 
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SEQlDNO_ll_Datisca_glomerata {221} 

SEQIDNO_13_Glycine_max {2 07} 

SEQIDNO__15_Medicago_sat iva {204} 

SEQIDNO_17__Nicotiana_t abacum {23 6 } 

SEQlDNO__lS_Oryza_sativa {2 37} 

£EQlDN0_37_Gryza_sat iva (2 37) 

SEQIDWO_2 3_Trit icura._aestivum (2 29) 

SEOlDNO_ 2 7_Arabidopsis_thal iana (233) 

SEQIDNO_31_Arabidopsis_thaliana (204) 

SEQIDNO_3 5_Arabidopa is_t hal iana (220) 

SEQIDNO_39_Arabidopsis_t hal iana (156 ) 

SEQIDNO_5 0_Arab idop s i s_t hal iana (262) 

SEQIDNO_46_Arabidopsis_thaliana (152) 

£ E QI DNO_2 1_ Petuni a_hybr i da (219) 

£EQIDNO_25_Capsicum_annuum (228 ) 

SEQrDNO_2 9_Arabidopsis_thaliana {212 ) 

SEQIDNO__33^Arabidopsis__thaliana (17 5) 

Consensus (3 01) 



301 350 
iRSNFPISG- - - -D|p^E#EHPA^EPi£i^MK- - 

gSTKVGEDE - - -!gSpi|HPVM|KPR|FVI PKIEIPQFQ 

iSKKFFVBD- - |\f f'SPLV AAK K ? C L / K LE 1 PSHY ^ - 

LQLGLSIDCGRKSQLLPMV'.'SVFSPKI AKKPRLLFSLG ' • 

fVWRPCS KG KKIYIWD E EjJjEV^S FZ<K PRLLTT A 

^VWRPCSKG KKMWDEEftEVOSPL.^ r'KK t'RLL I A 

|VWRPCAKG KMMWEDD^^Q^pAFKKP R.LLTA 

LSLHHKf • P I VDft^L^jTGplPLLL i_<i-DQV I KKED jLSLKI 

S q _' :"-t,|^ F ET|S VL - -' 

see - — saiggkrgfdlnIpadqvsvtts 

DLVLTFGKN iVPQ|DI^FVN- 

DIjR |3KFQG (VFSATPAflDCHY 

I,i VN WKjELGRT IS 

IWPGFGSGE pgE^HPAkKK RLH 1 , P P K1,E LFKGL 

gHLGFCSGE - DEVESPHPAKKS^CLPPKYELFQFI 

fcKVNG Di»MgpMFAKK^i-'DFPEKP 

|SP - D|^M^iATg|PSK 

F EBV SPL KK RLL 
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Annex 4 



Multiple amino acid sequence alignment of 2XC2H2 proteins as found in the polypeptides 
recited by their sequence database accession number in lines 17 to 30 of page 15 and the 
paragraph bridging page 15 and 16 of the specification. The position of the motifs as 
represented by SEQ ID NO, 5, 7, 8 and 9 are indicated by a box over the consensus sequence. 

With the aim to improve visualization of the Alignment, the sequences of BAA19926, 
BAA20137, BAA21919, NP_182037 5 BAA19114, CAA18741 and CAB80245 were truncated 
at their N-termmus. 



301 



AAK01713 


(1) 


AF332876 


(1) 


AAM67193 


(1) 


NP_199131 


(1) 


AF250336 


CD 


AY0 3 4 99 3 


(i) 


AY0 63 00 6 


(l) 


CAA64 82 0 


(D 


NM_102538 


(1) 


X95573 


(i) 


CAA67228 


(l) 


CAA67229 


(i) 


X98670 


(D 


X98671 


(i) 


BAC4 34 54 


(i) 


NP____196G54 


(i) 


BAA05079 


(i) 


BAA0 5 07 6 


(l) 


BAA05077 


(l) 


BAA05078 


(i) 


NP__190562 


(l) 


NP_2 01546 


(l) 


NP_188592 


(l) 


BAA19926 


(276) 


BAA20137 


(288) 


BAA21919 


(1) 


BAC43008 


(63) 


NP_182037 


(78) 


AAL76091 


(1) 


BAB67885 


(1) 


CAA6 72 3 3 


(1) 


CAA67236 


(1) 


BAA19111 


(1) 


BAA2192 0 


(1) 


CAA43111 


(1) 


BAA19110 


(1) 


BAA2192 3 


(1) 


BAB67879 


(1) 


BAA21921 


(1) 


BAA21922 


(1) 


CAA6 7 2 31 


(1) 


CAA67232 


(1) 


CAA67234 


(1) 


CAA672 3 5 


(1) 


CAB90936 


(1) 


CAB90935 


(i) 


CAA67230 


(1) 


CAB67667 


(1) 


BAA19114 


(50) 



350 

- MS S ASggE AgHAAVLKE EQQ QHEVEEATVVT S S S 

- MS S As|||E AfHAA VLKE EQQ QHEVEEATVVT S S S 

M|j|EAf|MSPR LVEDPL 

Mg|EA|jNS PR LVEDPL 



- -LFEDSS 
--LFEDSS 

- -LFEDSS 

- - LFEDSS 
- -LFEDSS 

- - LFEDSS 

- ™ LFEDSS 

- - LFEDSS 

- - LFEDSS 

- - LFEDSS 
- -LFQDSA 

LFQDSA 
--QFENNG 
- PRYEDQV 

- M|gEAfjjKS PTAATPSLP PRYEDHV 

- MT|Et|kS S S TPKTS KP TIPLPPK 

-m^dt|nsptsttttt- -apppflrcldet 

- m^etpsjs ptatttar p llryreem 

" "|ea|ntptssftrietkedlmndavfie 

" DEJFHRNGNYQWNTSNTSLGCWFDESGP 

qsessdsayfleenaivesdvHdgfgimgnskwitskmskaawcdesrt 

mdllqdresetesjfpyptqckrykrx j1sfsrjsdthynq 

ledg es es eslrnvgstltrkrs krtrkld s f vtkkvkts 

lddele fdfaeddd ve s etesbr - - bnptrrrs krtr klg s fdfdfe klt 



Mf||E AgTS PRLAS PIP - - 

^AJgEA§T3 PRLAS PIP- - 

M^E A§TS PRLAS PIP-- 

y. ALE AjjTS PRLAS PI P - 

M&gEAgTS PRLAS PIP-- 

M^EA§T S PRLAS PIP-- 

— M H EA I T s PRL AS pip-- 

MliEAliTS PRLAS P I P 

MlpSAfT S PRLAS PIP- - 

"•"AllE^TSPRLASPIP- - 

M P Ert l T SPRI J SSPMP- - 

m||||et||t s prl s s pm p - - 

mKeaInspttttpps - - 

IeaIks ptaatptlp - 



QSENSDSEYFLGEYKKVESD. 



- - MGQDEVGS DQTQ 1 1 KGKRTKRQR S S S TF WTAATT VTS T S S SAGG 

- - MGQDE VGS DQTQ 1 1 KG KRTKRQR S S S TF WTAATT VTS T S S SAGG 
-MEALEEWGVPNKDHFHl|KGKRTKRLRLQSPIPFTVAATHDSSNA 

Me|qMQEDHDHHMNMVI KRRRTKRPRPS SP 

MEFSEDSI 



dddqasdwedkfggsvkegnkr|yq|rtnpnrqksmrvcencgkefsswk 
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BAA2 1924 


{ 1 


BAA2 ±92 5 


( 1 


BAA2192 6 


(1 


CAA18 741 


(50 


CAB 8 0 2 4 5 


(50 


Consensus 


(301 



SSSS S SPKfgAS KPKYTKRPD PNAPKI TRP C TECGRKF 

ssss sspk§askpkytkkpdpnapkitrpctecgrkf 

AL L 



AAKU _L / X 3 


( 3 b 


Ar ii^o / 6 


( 3 5 


AAM6 7 19 3 


(17 


im p iyyj.ii 


(17 


Air zbUiio 


( 2 4 


Ax U 34 9 9 a 


t 2 4 


Ax Q6 3 0 06 


(24 


CAA64 82 0 


(24 




(24 


A9 b b / 3 


(24 


L.i\A.b / jL £ o 


( 2 4 


LAA b / z A y 


(24 


Ay o b / u 


(24 


Ay o b /i 


(24 


BA.L-4 J4i)4: 


(24 


J>iF ± y o U b 4 


(24 


J3i\A UDU / y 


(24 


dAAu b 0 / o 


(25 




/AC 

( 2 b 


13AAU bU / o 


(25 


"kt n i n a im 

NP 1 y 0 5 5 2 


(29 


IMF zu lb4b 


( 2 o 


KTT1 "1 O O C O 

.NP 1 a b b 9 2 


( 3 1 


BAAiy y 2 o 


(326 


J3AA2 Ulb / 


{ *3 "3 O 

( 3 3 o 


IDAAOI £1*1 & 

t$pj\z iyiy 


( 3 y 


Tj AH/l -A n A Q 

hSjH.L-4 3 U U b 


fin*} 
( 1 U Z 


IMF 1 d 2 U 3 / 


(12 6 


aaij / o u y i 


{ 1 


rJ JAri b /bob 


f A 

( 1 


LAAd / 2 3 3 


(46 


LAAb / z 3 b 


( 4 b 


bAAl y 11 1 


(47 


r)7\ Tv A 1 n A A 

BAA 2 1 y 2 0 


( 3 1 


CAA4 3 111 


( 9 


.bAAl y 110 


( 1 


BAA2 lyzi 


/ A 


n 7i n /~ n d *"7 ("j 

BAB 6 7 87 y 


( 1 


TA7\ A A 1 t — , - — i "I 

BAA2 1921 


f A 


BAA2 1922 


(1 


CAA67231 


(1 


CAA6 7 2 3 2 


(1 


CAA6 7 2 34 


(1 


CAA67235 


(1 


CAB90936 


(1 


CAB90935 


(1 


CAA6 7 2 3 0 


(1 


CAB67667 


(1 


BAA19114 


(100 


BAA21924 


(1 


BAA21925 


(1 


BAA2192 6 


(1 


CAA18 741 


(87 



351 

ATSGEEGGHLPQG W||KRXRS - -PRQR 

ATSGEEGGHLPQG W|KRKRS- -RRQR 

RFNGVEQWTKCKK Rf|KRSRSDLHHNHR 

R FNGVE Q WTKCKK R|KRSRSDLHHNKR 

VFHGVEH WTKGK R§ KR 5 R S D FHHQN 

VFHGVEHWTKGK r|kRSRSDFHHQN 

VFHGVEH WTKGK R|KRS R SDF HHQN 

VFHGVEHWTKGK R|KRSRSD FHHQN ™ - - - - 

VFHGVEHWTKGK RjKRSRSD FHHQN 

VFHGVE HWTKGK r|kRS R S D FHHQN 

VFHGVEHWTKGK r|kRSRSDFHHQN 

VFHGVEHWTKGK R|KRSRSDFHHQN 

VFHGVEHWTKGK r|kRSRSDFHHQN 

VFHGVE HWTKG K r|kRSRSD FHHQN 

LG FHGSKGK R|KRSRSEFDRQS 

LG FHGSKGK r|KRSRSEFDRQS 

LKY~ LE S WTKGK- R|KRQRS MERQ 

dmsnldswvkgk r|krprietppss 

DMNNLDSWVKGK r|kRPRIETPPSE 

PINDAIDIHKRK r|kRPRIETPPSE 

EPENLESWTKRK RTKRHRIDQPNPP 

E PENLE Q WAKRK RT KRQR FDHGHQNQETN - 

PWLKRKRS KRQRS H|P SSSSSSPPRSRPKS™ 

AEKKELTRMKNYV TDSRKDLSKDYKYDSYG- 

HKE KGLNR1SF KRNT IDSSKD1 TEECEDDDYW - 

FLSLERRRQQQQQQ YGKITEFPFVESEP 

QLGYKPESDQEPP HgSASDTTTEEDLAFCL- 

TSQPSELVAEPEH H|SASDTTTEEDLAFCL- 

MT1TREEAESK 



400 



SGGERAVSDEYNS AVSSPVTTDCTQEEE 

SGGERAVSDEYHS AVSSPVTTDCTQEEE 

CDYHKNDTDNGEDVPHNNMNNNfDNNASPSSAPSAEMFIN 

L ALT I ATS S C S T VEGTHAG E LDG HVAN SSSSPSNSG 

DHTLVFKGKRSKRPRQLSPDIY|SSTTSTTQISSSS 

, , -MVALSTKREREE 

MWLPLKRSRE 

MANCPSQNATPAjRRAPIAMSKRSRS 

f MSAMKRSR--E 

MSSIKRSRSEE 

MVAISE 

MVAISE 

MVARSEE 

MVARSEE 

MVARSEE 

MVARSE 

RSDFEES 

MKRDRSDYEES 

SFLEHGKCSSEDAEESLVSSPG|EGEDYIYDGRKEKGYGWSKR.KRSLRTK 

MTIKKRSREE 

MTIKKRSREE 

MALEIRSRSEEAK- 

WSWKALFGHMR-C H PERQWRG I NP P PNYRV 
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CAB80245 (87) WSWKALFGHMR-C HPERQWRGINPPPNYRV 

Consensus (351) S 1 

SEQ ID NO 8 

401 450 

AAK0 1713 (59) i^gN^L^^^^GGHKR 

A F 3 3 2 8 7 6 (59) $E3BN^AL CLLMLARGGHHR 

AAM 67193 (44) LfEE E Y LA FCLML LARD GGD 

NP 199131 (44) 1 :T'E£ E Y L A F CLMLLARR GGD 

AF2 5 0 3 3 6 (49) LTEEEVLAFCLMLLARDNRQ 

AY 0 3 4 9 9 S (49) L£E3 E YL A h' CI. Ml. LARDNR Q 

AY 0 6 3 0 0 5 (49) LTEEE ± LA F CLMLLARDNRQ 

CAA64320 (49) LTESE ■ LAKCL.M.LLARDNRQ 

NM_J 025.3 8 (49) LTEjSE YLAFCLMLLAJRDNRQ 

X95573 (49} I_i §J jpi^ § x r! i|;|l F Jj ^J^jE^OMP. Q 

CAA67228 (49) LTEBE Y LAFXLMLLAR DM R Q ~ 

C AA6 7 2 2 9 {49} L$EEEY LA F ,f . /LMLLARXFR Q 

X9867 0 (49) L|EBEYLAF/CLMLLARDN"RQ 

X98 671 (49) ~l!|^y|§FXLMLLARDNRQ 

B AC4 3 4 5 4 (46) * I A ! .CLMLLAR DGDRNRDLDLP 

NP__1 9 6 0 S 4 (46) L jE|EY||ALC^HLLAR .. ll^MRDLDLP 

BAA0 5079 (46) C^^Y^^g^OTLARSDGSVI^SRSL 

BAA0 5076 (50) -EE"LA : .C1»JEW|LARSGNGTTPS S IP 

BA AO 5077 (50) KYB L ? J 4^ L ^ RSGNGrTPGSTD " - - - 

BAA0 5 0 78 (50) K S FLAXCL XMLAR9 G - GK11PTTTP - - 

11F 190 5 62 (54) Pgf ^'Yg.LCLLML ARG SSDK 

NP_201546 (55) KNLP^^^^'LClLMLARGSAVQ 

NP^ 18 8 5 92 (61) Q NQDL^^YLAT .C L LML AjDQP S QTRFHQQ 

BAA1992 6 (356) MVSHL j|KR E S R KRpKDS j Y P PNL S NET FKN 

BAA 2 0137 (368) LSSYE§KCEPRKRfiRDs|jYHPELGNES YKK 

BAA2 19 1 9 ( 67 ) VSS ISDTfjP j8f D^N^^p^gDKWMTQEHEVI 

BAG 4 3008 (132) MMLSRj|KWKKNKSNKE jvEE IETEEE 

1STP_1 82037 (156) IMLSRgpCWKQQKKKKQRVEEDETDHD 

AAL 76091 (12) EME§LRVHA||AL§SgspPAAS AS 

BAB 67885 (1) MEgLRVHAgALlSf sftpAAS AS 

CAA67233 (74) DMAICLIM^RG^pPgPDLKNSRKIH 

CAA6 72 3 6 (74) DMA1 CL 1 MLAROTVLPS PBLXN S RKIH 

BAA1 9111 (87) TFTE^gET|RY|^^SCDDHLHPRNKSRDGLPPDKFEL 

BAA21920 (67) — - ~ - IDILIRKF EBEIjMA MCYILLA,, mWNQKPSP S • - 

CAA4 3 111 (45) SREg^DMAlTCLILLAQSGQSHKQKFSS 

BAA19110 (13) DNFYS ITT^NY|p^g|QAN"EHFDK 

8 AA 2 1 92 3 (12) AEFK.S I TTg&N LML r 3HQEN - HFNT 

BAB 67879 (27) MWDMQEFVGS VDTAP VLMLL&QQSQHGLLGGGG 

BAA21921 (10) BRQViAAAppiCLMLLS|LKDKSTST - 

BAA 2 1 92 2 ( 12 ) YGOVgAiA^KCLMLLsf LNDHNTS 

CAA6 7231 ( 7 ) IKS|\^TAfNC2M-,^SRVGOEMVD 

CAA6 72 3 2 (7) 7 ?CS XVD VT ^ Aj vCLKLLSRVGQEN'VD 

CAA6 7 2 34 ( 8 ) 1 V I Vg^^TTALYCLKLLSRV GECGGG - - 

CAA6 7 2 3 5 ( 8 ) IVIVJ^TTAF.CLMLLSRVGECGGG 

CAB 9 0 9 3 6 ( 8 ) IVIVjjpfT j 'A i< CLML LSR YGECGGG 

CAB 90 9 3 5 (7) EVEIVg|TAA^CLMLI,SRVGECGG 

CAA6 72 3 0 ( 8 ) L KN I jjj) I AKC L M I jjA QTjMVKQ I GLNQ 

CAB 6 7667 (12) MKH1 jl VES LHM LSR S F WKQ ID VKQ 

BAA1 9114 (ISO) VGGLSTSTYQS SEEjBLL^K^^JpNARVDTSLVE PEES CAS ASREEE 

BAA2 192 4 (11) - - - LAKV^TQA| AXCUY I LEONTWL AR 

BAA 2 1.92 5 (11) LAKv|tIJ^N^nBe^TSL^ 

BAA2 192 6 (14) S E VSVEP LA^EN CAXTLQ' RNQLLGES - 

CAA1 8 7 4 1 (116) PTAAlSKQLNQILPNWjpFMSEEDHEVASC 

CAB8 02 4 5 (116) PTAAjfS KQD NQI LPNW|jp FMS EEDHE VAS C 

Consensus (4 01) TE pl" LA CLMLL} aR 

SEQ ID NO 9 
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451 



AAK01713 
AF332876 
AAM67193 

NP___199131 
AF2 503 36 
AY034998 
AYO63G06 
CAAS4820 

1TM_102538 
X95573 
CAA67228 
CAA67229 
X9S670 
X98671 
BAC43454 

NP__196G54 
BAA05079 
BAA05076 
BAA05077 
BAA0 5 07 8 

MP_190562 

NP_2Q1546 

NP__188592 
BAA19 92 6 
BAA20137 
BAA21919 
BAC4300B 

NP__182Q37 
AAL76G91 
BAB67885 
CAA67233 
CAA6723 6 
BAA19111 
BAA2 192 0 
CAA43111 
BAA19110 
BAA2192 3 
BAB67879 
BAA21921 
BAA21922 
CAA67231 
CAA67232 
CAA67234 
CAA67235 
CAB90936 
CAB9 0 93 5 
CAA67230 
CAB67667 
BAA19114 
BAA21924 
BAA2192 5 
BAA2192 6 
CAA18741 
CAB80245 

Consensus 



(79 
{79 
(64 
(64 
(69 
(69 
(69 
(69 
(69 
(69 
(69 
(69 
(69 
(69 
(73 
(73 
(73 
(74 
(74 
(73 
(75 
(79 
(92 
(386 
(398 
(100 
(158 
(182 
(35 
(23 
(101 
(101 
(127 
(99 
(73 
(39 
(37 
(60 
(36 
(37 
(32 
(32 
(33 
(33 
(33 
(31 
(34 
(38 
(200 
(35 
(35 
(41 
(146 
(146 
(451 



500 

- -VQA- -PPPLSASAPPPAG AE^CS VCGSS Fg|§QAL 

- - VQA — PPPLSASAPPPAG AEg§CSVCGK^ r'SoYQAL 

LDSVTVEHKP Sf gCGV C Y KT i-'gsMQ ^ . 

LDS VTVAE KP SfgCGVC YpT F 1S§QAL 

PPPPPAVEKL S M| CS VCDiTFiilQAI 

pppp PAVE KL SgCSVC DKi ' F Sg YQA1 

PPPPPAVEKL Sf|c SVCD.fel F 1 

PPPP PAVE KL S |||C S VCD|CT F |g '^QAL 

PPPPPAVEKL S^geS VCDiTFiifQAL 

PP PPPAVE KL SllCSVCDil F§|§|QAL 

PPPPPAVEKL s ll§ CS VCDKTF$aSQAL 

PPPPPAVEKL SyfCSVCDiTFiliQAL 

PPPPPAVEKL SXfcs VCDKTFgg^QAL 

PPPPPAVEKL S^CSVCrDKTFSgjQAL 

S5S33- -PPLLPPLPTP li|CSVCDEAFp|^QAL 

PPLLPPLPTP I M§C S VCDigAF #SQ AL 

PPSVPVTSQINATLLEQKNL^CSVCC-KGF "SY;QAL 
KEPEKKNRDVAPVYQETEQS YKCS v :"-':.jXS FSS^QAL 
KEPEKKNRELTPVHQETEQsjT" 
QVQE PINKPLQVQE P INEQS| 
SPPSDHHSLSPLSDKQK- -D| 

PPLPPL PSRAS PSDKR D§ 

SLTPPPESKN LP| 

KSPEGSKHTQ - KKKF 



lCSVCD|SF^QAL 
|CNVCNI|S FHgfQAL 
|CSVCGiSFp|gQA] ) 
§CT VCGfS FSSYQAi. 
ICKF/CEKAFPSYQAL 
£F C L M C K K T rSS YQAh 



-KGSEGCKKIH- 



- -NKKKf ECLNCKK 1 KGSYO-',; , 
-RGRGK|l CETCNKVFRSYQAL 

- -TKGR^CETCGKVFKSYQAL 

- SHGRBCETCGK\ 'i'SYQAL 
-TEGV§ECKTCS$RFP^QAL 
- TEG vj§E CKTC S KP: F F SFQ AL 



sssss 

PPPPL 

GSTDTTTIS 

TTITTTIS 

TT1THEPL 

S 

SQ- - 

VKPSF 

MKLSH 

DNSASYDEDVKTEDSWVKVTTTR- 

3--EGYNKI1STQAT 

S - -EDYKSS K 

QPTSSSST 

QPTSSSST 

QKISSENS SFYVKECKTCNfTF^pQAL 

QKXs SENS S F YVf ECKTCNf T F||gQAL 

FNDDLKL YQS KFNS KR Y T ETSTNLGNGTKAG I F vj§ ECKTCNjfT F pjjfQAL 

HS PLD V|QCKTGNpC F PtjfQ AL 

RKFTETATSTGKAGFYvf ECKTCN^TFP^QAL 

KMN3SIS S TS RVpE CKTCNl-QBlll |§QAL 

MMDNS PSRVJeCKTCNJQFI^QAL 

FAAGAQP VWRGGAHDRVpE CKTCNgQFPgQAL 

TTTNQD -HHNDfECKTCN'lRFgliQAL 

KNQD HHNEgE C KTCNK. : ? F ' - ' S |QAL 

GGD QKRV§TCKTCL|fQFH§gQAL 

GGD QKRVjTCKTCLifQFH^QAL 

COG DERVjpiCKTCLlCLFSSiQAL 

CGG DERV^CKTCLK - FSSjQAL 

CGG DERVppCKTCLKL FSSS?QAL 

GG EKRV^CKTCLKEFSSrOAL 

HTESHT SNQ|ECKTCNKRF£S£GAL 

STGSKTN HNNh|eCKTCHJKFD^QAL 

R AARHS MAYG FT PL VS TRVPFDNKAKGAS - SKGLJeCKACK^VFN^QAL 

ki|ecktckKqfd^qal 

Kl|ECKTCX;igQFDp|QAL 

s SKlfECKTC : KKQ ^d|§|QAL 

LLMLS - -NGTPSS S S IERiECGGCK|vFG^QAL 

LLMLS - -NGTPSS S S IER|E;CGGCKjVFG^pAL 

YKC C K FSSY[QAL| 



SEQ ID NO 5-firstpart 
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AAK01713 (113) GGHgTM|VKLPT PPAAP 

AF3 3 2 876 {113} GGH^T'g^VKLPT PPAAP 

AAM67193 (91) GGHp^^S LYGG 

NP_1 9 9131 (91) GGHltASHRG ■■ - LYGG 

AF2 503 36 (96) GGH^^^NLSQ TLSGG 

AY034 9 93 (96) GGH^^^^NLSQ TLSGG 

AY06 3 C06 (95) GG H KA S B R KN L S Q TLSGG 

CAA64 82 0 (96) GGh|cASHRKMLSQ TLSGG 

NM 10 2 5 3 3 (96) GGHfeASHRKNLS Q TLSGG 

X95573 (96) G G B KAS HR KN L S Q TLSGG 

CAA6 7 228 (96) GGBKABHRKNLSQ - TLSGG 

CAA6 7 2 2 9 ( 96 ) G GH KA3 H H K3nT L S Q TLSGG 

X 9 8 6 7 0 (96) GGH&ASHRKNLSQ TLSGG 

X 9 8 6 7 1 (96) GG'HSjvAS HRKlvLSQ TLSGG 

BAG 4 3 4 54 (10 5) GGH^gSigSFSL TQSAG 

NP_1 96054 (105) GGB^B^^S FSL TQSAG 

BAAO 5 0 7 9 (114) GGHKASHRK LVSMG 

BAA0 5 07 6 (119) GGHgASHRK 7 TT1 ATTAL 

BAAO 5 0 7 7 (118) GGH§ASI-ffi KITTI ATTAL 

BAA05078 (117) GGH|^^NKNLS TTTVS 

NP__190 5 62 (110) GGH-K SHRKPVSV DVNNS 

MP__2 01546 (113) GC i li I SHRKPTNT S ITSG 

NP_18 85 92 (12 2) GGH^g^IKPPT VIST 

BAA1 9 92 6 (421) G G Hj| P C MK l I T N A YFESTYETGENSRD AD1M G PN Y I N KG KH RETLSNKP AAH 

BAA2 013 7 (433) jGGHgPC|^ANS YVE S INGTGENS LDADHDG KPFSA 

BAA21919 (145) GGHj^^Sl 1 ^ 3 

BAG 4 3 0 08 (18 9) GGHiRASHKKNRVSlJXKTEQR ■ 

NP__1 82037 (210) GGH|giESP^ CMTKrrE Q 

AAL7 6 0 91 (63) GGH1t|§T| LQAKL 

BAB 6 7 8 8 5 (51) GGilf l|§T j LQAKL 

CAA6 72 3 3 (12 9) GGH^gHg^PRTSTEEKTRLPLTQPKSSASEEG QJSISHFKV 

GAA6 72 3 6 (12 9) GGH3|^^g|PRTSTEEKTRLPLT0PKSSASEEG QKTSHFKV 

BAA1 9111 (177) GGURASSi KKPKTLTTELVNNKK LYFDFS 

BAA2 192 0 (121) GG : i RASHKK? KL P TNLEE KNS KP I EHVENCS KSHEDHVTTL S L Q I SNNNT 

CAA4 3111 (106) GGHR" 3 H'KK-S KT I AAE KT S TLEDHHQQQE R VAQ EEGEFXKI IPS 

BAA19110 (65) GGMMA3.HKKPRL M 

BAA21923 (62) GGIiBAS HiCKP R L M 

BAB67 8 7 9 ( 94 ) GGHRA3HKRPRQQ QQ KALGGG 

BAA. 2 1921 (62) GGHRASH KRPK 

BAA2 19 2 2 (61) G C \ G-|§^fe|TK 

CAA6 7 2 3 1 ( 5 5 ) 3' GI^^^^PN 

CAA6 7 2 3 2 (5 5) GGi I1^BH|PN 

CAA6 7 2 3 4 (56) GG H g|Nf^ L IW S 

CAA6 7 2 3 5 (56) GGHRASHKK L IN ~ S 

CAB 9 0 93 6 (56) GGHRASHKKL IN S 

CAB90935 (53) 3 i^RlIN S 

CAA672 3C (59) GGH^0g^gP<:., T 

GAB6 7 6 67 (65) GGHRASHKKPKL - 1 

BAA1 9114 (249) GGH^Sg&VKG CYAA 

BAA219 24 (53) GGHRASHXKPK F 

BAA 2 19 2 5 (53) GGHgASHKK PR F 

BAA2 1 9 2 6 (61) GGHR ."SiJKILRN KL 

GAA18 74 1 (17 8 ) GGHftASHKlGVKGCFAITKrVTDDP MT 

CABS 0 245 (17 8) pGHg gl^WKGGFA IT RVTPP P MT 

Consensus (501) |GGH] RASHKK 



SEQ ID NO 5-cont 



42 



Ann Isabel SANZ MOLINERO 
Appl. No. 10/537,897 
Atty. Ref.: 4982-5 
Amendment 
November 18, 2008 



AAKO 1713 


(131) 


AF3 3 2 67 6 


(131) 


AAM67193 


(10 4) 


NP 199131 


( 104 ) 


AF25 03 36 


( 114 ) 


AY0 3 4 99 8 


(114) 


AYO 6 3 0 06 


( 114 ) 


CAA64 82 0 


(114) 


MM_1 02 5 3 8 


( 114 ) 


X 9 5 5 7 3 


(114 ) 


CAA6 7 2 2 8 


(114) 


CAA6 7 2 2 9 


( 114 ) 


v r*i o <~j 

X9 8 6 7 0 


{ 114 ) 


X 9 8 6 7 1 


{ 114 ) 


BAC4 3 4 54 


( 123 } 


MP__1 9 6 05 4 


( 12 3 ) 


BAA0 5 07 9 


( 12 8 ) 


BAA0 5 07 6 


(13 7) 


BAA0 5 07 7 


(13 6) 


t~i "Ti. t\ r\ r r\ n ri 

BAA0 5 0 7 8 


(13 5) 


NP__1 9 05 62 


( 1 2 8 ) 


IN P z U 1 b 4 t> 


\ 1 3 1 ) 


KTP 1885 92 


(13 9) 


BM1 9 9 2 6 


(471) 


B AA2 013/ 


(469) 


BAA2 1919 


(158 ) 


BAC4 3 0 0 8 


(209) 


NP_18 2 0 3 7 


(229) 


7\ ti t n i^ 1 A n i 

AAL / 6 0 9 1 


( 77 ) 


BAB 6 7 8 8 5 


(65) 


CAA67 2 3 3 


( 16 9 ) 


CAA6 / 2 3 6 


(16 9) 


BAA1 9111 


(2 05) 


BAA2 ±92 0 


(171) 


CAA4 3111 


(15 0) 


BAA1 9110 


( 78 ) 


BAA2 ly^i 


(75) 


BAB 6 7 8 7 9 


(115) 


BAA2 19 2 1 


( 7 j ) 


BAA2 19 22 


(72 ) 


CAA6 72 31 


(66 ) 


CAA6 7 2 3 2 


{ 66 ) 


CAA6 7 2 34 


f f n > 


CAA6 /23b 


I 6 9 } 


C,ABy 0 93 6 


(69} 


CAB 9 0 9 3 5 


(66) 


/"I 71 TV /" ry n n n 

CAA6 7 2 3 0 


(72 ) 


LABo /bo/ 


( 78 ) 


BAA 1 y 1 1 4 


( z b 5 } 


BAA21924 


(65) 


BAA21925 


(65) 


BAA21926 


(75) 


CAA187 41 


(203) 


CAB80245 


(203) 


Consensus 


(551) 



551 600 
VLAPAPVAALLPSAEDREPATSSTAASSDGMTH^7gRCi|CQlE s F|^GQA 
VLAPAP VAALLPS AEDRE P ATS S TAAS SDGMTI IRWH CST 2 QKH F PTG : A 

GDNDKS TP S||A V .gSg VCgCQgSlF ApOQA 

GENDKSTPs|a V KSgVCgf CGpSFA||GQA 

GDDHSTSSAfTTSA VTTGSG %SM VCTj;CM^S PpgCQA 

GDDHSTSSAfTTSA VTTGSG ^SUVCf TGN^S FPlGQA 

GDDHSTSSAFTTSA VTTG S G KSf VCggCNiS F^GQA 

GDDH S TS S A|TT S A VTTGS G jspVCB CNiSFg^GQA 

GDDHSTSSa|TTSA VTTGSG &VC§f CN|s ; ?f |gQA 

GDDHSTS S a|tT S A VTTG S G KSHVCTTCNKS FIG Q A 

GDDHSTSS.AHTTSA VTTGSG KSH VCTIG^KS PSCQA 

GDDHSTSSAFTTSA VTTGSG jspVC^p.CN|S F^jGQA 

GDDHSTS SApTTS A VTTGSG If Sl^/cif CXgS Fg|GQA 

GDDHS T SSAlTTSA VTTGSG KS jfVC^CISr js F^GQ A 

GDELSTSSA1TTSG I SGGGGGSV™ - KSiJVC^CHjsFApGQA 

GDELSTSSAITTSG ISGGGGGSV - •• KSfiVC :|l:CH||S FA|GCA 

GDEQSTTS T|TNVT GTS S ANVNGNGiTfiE c||CHlCF;i|GQ A 

LDDNN3SINPT|SNS TNG NVVUN I S TLNP3G jf S HVC S X CHKA F PSGQ A 

LDDNNjMNPT|SNSTSG NVVKMISALNPSG|sMvcg|CHpAF^GQA 

YDDTN - - PS JSNSLN PSGR^H^G^iCHjCFSgGQA 

NGTVTNNGW1 SNG L VGQSGKTHNC S I C F Fg|§GQA 

NQELSNNSH§!SrSGSW- - IMVTVT^TGNG VS Q SG;K 1 Htc||||c FpSFA^GQA 

TADDSTAPTI S IV AG EKHPIAASG|lgEcfecHiVF|gGQA 

SHDY S SNPE KKMKP KKFICGISaCP F C pJmFkJgQA 

VKEPSYNPEKKIKP KKVjEG ti E C PY G DpVlF K^G QA 

- - INETKNNGNVES E VQ K.G:K 1 MEC P jCY jvFS JGQA 

SETEYDNVW VAk|iHkC =TCl|vFA|G0A 

VETEYVLGVK EKK- V§EC P|C F|VF1||gQA 

LSDPAAAAAAAAER DRAgVgEGAgCGVEFSMGQA 

LS DPAAAAAAAAER DRA|v!|ECA|CGVE : FSMGQA 

SGSALASQA§KI IN KANKVHEC^CGS E'FT^GQA 

SGSALASQA§NIIN 2CAI1K^7HEC^CGSE|FTJgQA 

DEDDQPSPSiTLCKTNKDWRILPNSSNKYTSPil|EC|YCGAE|FT§GQA 

NNNNSNNNNNNNI IK NK^ViEC^CGAEjFTp5QA 

ISTQI INKGNNMQSNFN" S KS iK; I j E C A 1 CGAE|F T'pG Q A 

GELHNLQLFHE - - L PKKKTHECgl CGLE|FAIGQA 

GELN - FQLP|S - P P KP^r|[ECg|CGL EjFjP- 1 GQ A 

AGADDAGLCLGRQP TPPRPQPAKP^V^ECPpCGLEF jlGQA 

-LLIGAGEFLVQPS SK-iM|EC§f CGMEFSLGQA 

- VLTG AGE F LAQQA KKN||M jEC^f CGMEF S LGQA 

• MDAL S S GLMKK VKTS s|pC P jCGVEF gMGQ A 

- NDAL S S GLMKK VKTS s|p C 1 ■ I C C v V E ? P>1 G OA 

DNPSLLGSLpNK KTKTs|pCp|cGVKFf MGQA 

DNPSLLGSLgNK KTKTS§P€P§CGVKF|MGQA 

DNP SLLGS L|NK KTKTSIpC P§CG VKf|mGQA 

SDPSLLGSLgNKKT KTATSfPCPX CGVEF§MGQA 

VEQKDVKHL|ND-Y KGNHF-lKCitCSG^SFGlGQA 

VDQEQVKHPJ^ KENDMlRc|§CDQMFG jGQA 

KQDQLDD I L I DDQDVN I THDQE FLQSS KSMRKSg I §E Ci|cHiV FS§GQ A 

ITAADFSIE|SSY - -ECiFCGEDF^GQA 

I TAADFS I GpPNYK PNI^VfECf F CGEDF^GQ A 

LTSLPGNDQLPVKT % KH E Cjjjf C GE Q L LGQ A 

VSTSSGHDHQGKIL TFSGH|KCn|;CF|VFS|GQA 

VSTS S GHDHQGKIL TPS Gh|kGF|C f|ws|gQA 

t k h CSIC K FPSGiQA] 
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AAKO 1713 


(181) 


AF3 3 2 876 


(181) 


AAM6 7193 


( 133 ) 


NP_19913 1 


( 133 ) 


AF2 5 03 3 6 


( 15 1 ) 


AY034 998 


(151) 


AY063 006 


{151 } 


CAA6 4 82 0 


( 151 } 


T.TH jr -I «"\ 1— -\ l"\ 

NM_1 0 2 5 3 8 


( 151 ) 


X95573 


( 151 ) 


CAA672 2 8 


(151) 


CAA6 7 2 2 9 


( 151 ) 


X98670 


(151) 


X98671 


(151) 


BAC4 3 45 4 


( 163 ) 


ISI'P_19 6 05 4 


(163 ) 


BAA0 5 0 7 9 


(17 0) 


BAA0 5 0 7 6 


( 183 ) 


B.AA0 5 07 7 


( 182 ) 


BAA05 07 8 


( 168 } 


NP 190 5 62 


( 164 ) 


NP 2 015 4 6 


(17 9) 


NP_18 8 5 92 


(18 0) 


BAA19 92 6 


(505) 


BAA2 013 7 


( 503 ) 


BAA2 1919 


( 1 9 2 ) 


BAC4 3 0 0 8 


(239) 


ISfP 182037 


(258) 


AAL 7 6 0 91 


(111) 


BAB 6 7 8 8 5 


( 99 ) 


CAA6 7 2 3 3 


(203) 


CAA672 3 6 


(203) 


BAA19111 


(255 ) 


BAA2 192 0 


(206) 


C AA4 3111 


( 187 ) 


BAA1 9110 


(110) 


BAA2 192 3 


(10 6) 


BAB 67 87 9 


(15 6) 


BAA2 192 1 


(105) 


BAA2 192 2 


(105) 


CAA6 7 2 31 


(97) 


CAA67 2 32 


(97) 


CAA672 34 


(101) 


CAA6 72 3 5 


(101) 


CAB 90 93 6 


(101) 


CAB 90 93 5 


(10 0) 


CAA6 7 2 3 0 


(10 5) 


CAB 6 7 66 7 


(10 8) 


BAA1 9 114 


( 3 15 ) 


BAA21924 


(92) 


BAA21925 


(99) 


BAA2192 6 


(106) 


CAA18 741 


(237) 


CAB80245 


(237) 


Consensus 


(601) 



601 

P^GGH|!R§H^pGVGA 
jLGGHKRKi iYDGGVGA 
LGGKgRCH^GGVS - 
LGGHf|RCHf§§f GVS - 
LGGH|ECK| 
LGGHjgRCHj 
LGGHftRCH] 
LGGHgRC 
LGGHlRCHi 




LGGHgRCH^P^NNN 

LGGHtPCUiiGMMM 

LGGH|RCH|ii]>fNN 



650 

GAGAS 3TEL LATVAAE S E§§G S S GNG Q S 

GAGAS ST EL LATVAAE S Ej§G S S GNGQ S 

NSE GVGS|SH§SSSS 

NSE GVG S|:SH§f8SSS 

•NNN IHTSSVSNSEGAGS|SBlSSS 

■NUN INTS SVSKSBGAGSiSH|SS S 

rNNM INTSSVSlSrSEGAGS§SH§SSS 

'NN INTSSVSHSEGAGSlSHiSSS 

:MNM INTSSVSNS EGAG S§SH§SSS 

IMT S S VSMS EGAGSjSHgS S S 

INTS3VSNSEGAGS§SHf§SSS 

IMTSSYSNSEGAGSlSHiSSS 



LGGHlJpRCH] 
LGGH§RCHj 
LGGKpRCH 
IsGGH^RCH; 
LGGH(RCHf 

lgghIrP 

LGGHgl^ 
LGGHlRliHi 





m ^ IMTSSVSNSEGAGSiSHfSSS 

gNNN INTS S VSNSEGAGS|SH|SS S 

IfKNGG GVSSSVSNSEDVGS|SH|SSG 

fKNGG GVSSSVSNSEDVGS(SH|SSG 

|GWG NGNGSVS VGVTSiEG|GST ISH 

pCLGG NNNNNHRDGGGHS GSWTTSDGGAS TH 

3KLGG NHNN - HRDGGGKS G S VVTTS DGGASTH 

|NLGG GVSR™ GD - » Jvi SSEGGGSAV 

LGGK§RCK^fgGN GNSNCD NSH 

LGGHfRCK^fGNN- GNGNGS SSN 

!LGGH.|RCHSf NLGGGG GGGSKSISHSGSV|ST|SEERS - - - 

LGGHgRSHf IVSSE- • -NKYQASAVQGK 

LGGK JR SHflgGSFR NLMQ S SAAKKE 

LGGHg;PSHGljvAATN VSLSTKIVSSRI 

LGG MfeSHGV^N LSVN QQRRVHRNESV 

^GGH§RSHGSNIGAGRG LSVS QlgQlEEEVSV 

LGG HMRiHRC ETGTTTV VLADADDS G GATVPQP - 

LGGHMRjjHRGETGTTTV VLADADDS G GATVPQP - 

1... GGHMP§HRT§VTT ISP VAAT AE VSRNS TEE E I EINpGR S MEQQR 

LGGRMRfHRTiYTT ISP VAATAEVSRHS TEEE I E IMjGRSPlEQQR 

LGGKMR|HRGfVN VN SS LHL SNYVS PATS IDQE FANNTMKKVPR 

LGGHMr|hRPLPNS IAIASTSHELESSHE IKNTR 

LGGHMR|KRPPTITANTTNTKVTLSTTIDDTSNYTSESgHDYDEIKEKPR 

LGGHMRlHRAVIN- - DKN-LQAPDDQ HAP§VKKANGRR 

LGGHMRpHRAVMN ENN - LQVT P§VKKSNSRR 

LGGHMRjjHRAE AE AAAT TTTTTTKNGD VGKAAAVKACDG 

LGGHMRgHRA§ IDE KSK-™ - AATK- AM MI Pj'LKKSNS S K 

LGGHMRiHRiElSF^K TLK- - VARKTTT MI P§L KKSNS S K 

LGGHMR|HRNES GAAGG AL VTRALLPE P jfVTTLKKS SSGK 

L GGHMRp'KRNE S GAAGG AL VTRALLPE P jjJVTTL KKS SSGK 

L GGHMR|HRNE KVS - - G SLVTR SFLPETT jfVTAL KKFSSGK 

LGGHMRjfeRNEKVS--G SLVTRS FLPETT IfVTALKKFS SGK 

LGGHMR|HRNEKVS- -G SLVTRS FLPETT- - - -|VTALKKPSSGK 

;LGGHMR|HRS E KAS P-G T LVTRSFLPE TT ||VTTL KKS SSGK 

LGGHMRgH RS SMTV EPS-FISPMIP |MPlLKRCG S S K 

LGGHMRfHRTSMIT EQS - IVPSWY |RPiFHRCSSSK 

LGGH|§RCH§§ITSNS PDSSKFHFNGHVEQINLRSNMH 

LGGHMR§KPfKL KKKKQKNCDDKLGgGKS 

LGGKMR1hr|KLGH EL- KKQKQKNCDEIVEIEKKNNS GT 

LGGHMRiHRlELNQ LQQQKKKIKMDDEKSDVSEEWQEKKGN 

LGGHMRCHH KE EEPMISG 

LGGHMR ChJ|kE EEPMISG 

[ LGGHj KRRHYEG T V 
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A AKfl 1711 


(223) 




(223) 


A AM£i 71 


(16 2) 


TvT P T QQ1 71 


( 162 } 


AP7 Rfl^^ 


(18 6) 


JTi X U J T J O 


(18 6) 


r-i, 1 U D J U U D 


( 1 S 6 ) 


-rtJT. vJ *T U \J 


(18 6) 


KTM 1 0 9 S 1 R 


(18 6) 


X95 5 73 


(18 6) 


PA A£7 ? 9 ft 


(13 6) 


v-iirvu / Zi ^ j 


(18 6) 


V Q o £-7 fj 


(18 6) 


^ 9 g g7 ^ 


(18 6) 


RAG4 14^4 


( 199) 


ATP 1 9 £ 0 R4 


(19 9) 


RAA0S079 


(206) 




(225) 


RAAG ^077 


(223) 


R A AO R D 7 R 


(201) 




(18 6) 


MP 7 01 R4fi 


(202) 


IVfp IRQ q QO 


(219) 


R AA 1 9 97 6 


(530) 


RAA7 0 117 


(529) 




(220) 


RAG4 1 00 R 


(266) 


KFP 1 R7 01 7 


( 2 90 ) 


AATf7 6 0 91 


( 144 ) 


D AR 7 R 8 R 


(13 2) 


CAA6 7 2 3 3 


(248) 


CA A67 7 1 


(24 8 ) 


RAA1 91 1 1 


(299) 


BAA2192 0 


(24 0) 


P A A4 1 1 1 1 


(237) 


RAA1 91 1 0 


( 14 5 } 


RA A? 1971 


M 1 G ) 

\ -L ~J U / 


RARfi? R 7 9 


(195) 


R A A 7 1 991 


(14 0) 


R A A7 1 9? 7 


( 14 1 ) 


CAA67 2 1 1 


(13 7) 


fAAf^TP 1 2 


(13 7) 


CAA672 1 4 

■£™i.f"i. O ( *j J M 


(14 0) 


CAA& 77 3 S 


(14 0) 


PAR 9 0 916 


(14 0) 


PAR 9 0 91 ^ 


( 14 0 } 


nanfi 7 7in 


( 141) 




( 144 ) 


RAA1 9114 


(351) 


BAA2192 4 


(121) 


BAA2192 5 


(137) 


BAA2192 6 


(148) 


CAA18741 


(256) 


CAB80245 


(256) 


Consensus 


(651) 



651 700 

ATRA gftA|P|F VWRP C S KGK - --KMWD EEE;H0i p l AFKKPRXj 

ATRArDI^iAiP8FVWRPCSKGK---KMWD EEKSVQSPgAFKKPRL 

• HRG Fgil 1 1 flQGF SR DD^Mgp|iATKKPRL 

- HRG J DliH I PVC 11 'F S P DD^M|p|aTKKPRL 

-HRGFPLlTIPPIPgFSMVMG DD^MpP|PAKKPRF 

■ HR 2 h 'DLNcI P P.:XP|FS MVNG DDffiM|p|pAKKPRF 

- HRG . : DLNlPPl PfFSMVNG Dr:^K : SP|PAKKPRF 

- KRGFDLNXP PIPpFS MVNG Dd|§m1p|PAKKPRF 

■ HRO-Wm&mmFSmHO D D;B||m! j P jp AKK P R F 

- HRG r D Li: IP PI p|f SM VMG DD^lVlf P jjp AKKP RF 

- HRG fDLKXpPX PEF SM^TSTG DDS : gK3PMPAKKPRF 

- liRG I'DJ-jNIPPI PfiF SMVNG D D § P || P A, K K P R F 

- HRG F-BL^IPPXPgF SI-1 VJMG DDgHMpPHPAKKPRF 

■■ HRG I- DLXIX PF I J - EH S M \ MG DDEVMS PMPAKKPRF 

-HRGFDLUIP?IP|FSMVNG DEElMgPHPAKKLRF 

- HRG - VLB 1 P ? I P B F SMVMG D E Vr; g P^'PA K KLP F 

- HR 1 "■ - IIS I PALPS F P GFGSG E L'EVESPH PARKS RL 

TLRDFDLNMLPPSPELQLiGLSXDC- - GLKSQQVPIEQ=EVE^P^?LI<:KPRL 
TLRDFg|j|||;L|fPS PpLQLGLS IDC - -DLKS ~QIPIEQ;g|^BpP|jF.LtKKPR.L 
I R RD - DL^LRP - S p|lTLGMS VDC - - ERKS - Q),,S GKQliE|P||PTKK PRL 

FA DLNLPA GQVSDETI GKS Q L|GEETKS VL - 

SV1LVAG5DVS|VDNERWS EES A1GGHRGFDLNL 

HRGFiiiSil^ASPiLSLHHN P I VDEm^P JTGKKPLL 

IVDALDA^LPAPVg'VNGE PAFVSW 

ADD ALDANA PA P T DDEDNEK AKFVSW 

S GTMfSSilPAT L^D EISQ I^SfVSDD EFVNP 

KQRMXDLNLPAPT^DEVS V§FQ — - 

K Q RM B^Pl^ p ^'S DETS l|fdew 

iEPMP|LNYPP LED|GDGSEPELL 

| E PMP|LMY P P LED§GDGS E PELL 

KY LPLDLNL P A PG^LRE S K FQGIVFSATPALI 

K^P^^^APGHLRESK FQGIVFSATPALI 

DGLSj^^^VSDpNsLDPKYPWS LKQQDQEKTQRQQLV 

NFLS^^^PAPE^HRPETKFSFAS KlQVIVFSAS PLVD 

I ILSLD1 NlxPA^pjgDHHSDNTKFDFS g5xQCL|FSAAALV 

~ ILS ^^B^H EN |- LEFD LRKp^TAPMVDCF 

-VLC^SlTSENpTLEFK LGK- -AARIVDCL 

GGVCIiDLNLTPSENRAKCRNWG LGAGGQGpHKALAML 

RIFCLDLNLTPR tNf^VDLKL WPT§P|S S PVLRI 

R I F C||DLNL : i P F N|pVDLKL W PT§ P|S S PVLR I 

RVAC^pS - LGfVpN LNLKL ^GRT|Y 

RVAC^pS - LG§Vf NLNLKL ^GRt|y 

RVACii|,DjiDS|ESLVNWKL |gGRI^SWS 

RVAC LDL L ' LD Sj|E G L WW KL ^GRTfSWS 

RVAC^|D|DSiESLVNWKL ^GRTfSWS 

RVAC LB LH ^ -SpEStiVKWKL |||GRT§§S 

R1LS|J|J|J|T|8EN|-"LEYI FGKTFiPKIDMKF 

- - E 1. h DANA TPLEN| - LVL I FGKNlJfPQ IDLKF 

KSDATiDLNNLPl*HH|MSRIRRDPFNPLSFEVSTDIHLQYPWSCAPKNDDN 

--LFtDLNL' PYENgLMLGI IPLKVFDS 

GXLFFpXNLTPYEN|LMLGI IPLKVFI 

AGLF - DLNLCPDENpVRARM TSVHS 

- - -ALDL24VPP - 1 1 QlLSTSD TSGCCBdLRLGL • 

ALDLITOPPT I Q|LS TSD TS GCC|DLRLGL - 

[LDLNLPPj l ED EV S M 
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701 750 



AAK01713 


(267 ) 


LTA 


AF3 32 876 


(2 67) 


LTA 


AAM67193 


( 193 } 




^P_l 9 913 1 


( 193 } 




AF250336 


(22 0) 


DFPVKLQL 


AY034998 


(220) 




AY0 6 3 006 


(22 0) 


DFPVKLQL 


CAA64 82 0 


(220) 




NM__10253 8 


(220 ) 


DFPVKLQL 


X95573 


(220 ) 




CAA6 7 22 8 


(22 0) 


DFPVKLQL- 


CAA6 722 9 


(220) 


DFPVKLQL 


X93670 


{220} 


DFPVKLQL 


X98671 


(220) 


DFPVKLQL 


BAC4 3 454 


(233 ) 


DFPGKP 


NP__1 96054 


(233 } 




BAA0r>07 9 


(242 ) 


SLPPKLELFKGL 


BAA0 5 076 


(273 ) 


LFSMD 


BAA05077 


(270 ) 


LFSMD 


BAA0507S 


(247 ) 




NP_190562 


(216) 




NP_2 0154 6 


(236) 


PADQVSVTTS 


NP__18 8 592 


(257 ) 


LTDHDQVIKKEDLSLK1 


BAA19926 


(555 ) 




BAA2 013 7 


(555) 




BAA2 1919 


( 254 } 




BAC4 3 0 08 


(289) 




NP_182 037 


(315) 




AAL7 6 091 


(16 8) 


NLLV 


BAB 6 7 8 85 


(15 6) 


NLLV 


CAA6 723 3 


(2 81) 


DCHY 


CAA6 72 3 6 


(281) 


DCHY- 


BAA19111 


(33 8) 


DCHY 


BAA2 192 0 


(280) 


CHY 


CAA4 3111 


(278) 


DCHY 


BAA1 9110 


( 176 ) 


L 


BAA2 192 3 


(16 6) 


Xj 


BAB 67 8 7 9 


(233) 


DCFL 


BAA2 1921 


(17 3) 


pj 


BAA21922 


( 174 ) 





CAA6 7 231 


( 163 ) 




CAA6 7232 


( 163 ) 




CAA6 7234 


(169) 




CAA6 72 3 5 


(169) 




CAB90 93 6 


( 169 ) 




CAB 90 9 3 5 


( 165 ) 




CAA67230 


(173 ) 


VL 


CAB67667 


( 174 ) 


VIST 


BAA19114 


(401) 


DN Y YLEE I K I DSMANN G KYN I NlSf GATQNVEDDE AD S KLKLAKLSD L KDMN 


BAA21924 


(147) 




BAA21925 


(164) 




BAA21926 


(173) 




CAA18741 


(285) 




CAB80245 


(285) 




Consensus 


(701) 
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751 



774 



AAK.01713 


(27 0) 


AF3 3 2 876 


(27 0) 


AAM67193 


( 194 ) 


MP_19 9131 


(194 ) 


AF2 5 0 3 3 6 


(22 8) 


AY0 3 4 998 


(228) 


AY063 006 


(228) 


CAA64 82 0 


(22 8) 


NM_102 53 8 


(22 8) 


X95 573 


(228 ) 


CAA6 7 2 2 8 


(228 ) 


CAA6722 9 


(228 ) 


X98 670 


(228 ) 


X98 671 


(22 8) 


BAC43 4 54 


(2 39) 


NP__196 054 


(23 9) 


BAAO 5 0 7 9 


(2 54 ) 


BAA0 5 0 76 


(278 ) 


BAAO 5 0 77 


(275 ) 


BAAO 5 0 7 8 


(254 ) 


NP__190562 


(216) 


NP 201546 


(246 ) 


NP__18 8 5 92 


(274 ) 


BAA1 992 6 


(555 } 


BAA2 013 7 


(555 } 


BAA2 1919 


(254 ) 


BAC4 3 0 08 


(2 89) 


NP_182 037 


(315) 


AAL7 6 0 91 


(172 ) 


BAB 6 7 8 8 5 


(160 ) 


CAA672 3 3 


(2 85) 


CAA6 72 3 6 


(2 85) 


BAA19111 


(342 ) 


BAA2 192 0 


(2 83) 


v^ J ?V^\4 3111 


(2 82) 


BAA19110 


( 177 ) 


BAA2192 3 


(167 ) 


BAB67 87 9 


(237) 


BAA2192 1 


(175 ) 


BAA2 192 2 


(176 ) 


CAA67 2 3 1 


(163 ) 


CAA67 2 32 


( 163 } 


CAA67 2 34 


(169) 


CAA67235 


(169) 


CAB9 0 93 6 


(169) 


CAB 9 0 93 5 


( 165) 


CAA6 72 3 0 


( 175 ) 


CAB6 76 67 


{ 176} 


BAA19114 


(451) 


BAA2192 4 


(147 ) 


BAA21925 


(164) 


BAA2192 6 


(173) 


GAA18741 


(285) 


CAB80245 


(285) 


Consensus 


(751) 
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Annex 5 

AtSTZparalog2: SEQ ID NO: 28 under the control of the beta-expansme EXPB9 
(PRO061 2} 

A DNA fragment encoding a 2XC2H2 protein represented in the application as filed by SEQ ID 
NO: 29 was isolated from an Arabidopsis thaliana seedling cDNA library (Invitrogen, Paisley, 
UK) by PCR amplification and subsequent cloning in an entry clone vector according to the 
methods described in Example 1 of the present application, SEQ ID NO: 29 polypeptide was 
encoded by the longest open reading frame of SEQ ID NO: 28 (AtSTZparaiog2). 

The primers used for the PCR amplification were as follows: 
Forward primer: 

ggggacaagtttgtacaaaaaagcaggcttaaacaatggcacttgaaactcttact 
Reverse primer: 

ggg9 acc actttgtacaagaaagctgggtttcctaggtttatgtttaggg 

The entry clone was subsequently used in an LR reaction with a destination vector used for rice 
transformation to generate the plant expression vector pEXPB9::AtSTZparalog2, The PCR 
amplified DNA fragmented was operabSy linked to the promoter PR0061_2 (pEXPB9: beta- 
expansine EXPB9), which is described in Table 10 in Table 10 on page 48 of the application as 
filed, 

Phenotypic characterization of the transformed plants was carried out essentially as described in 
Example 3 of the present application. The results are shown in Table I beiow. 

Table I: Results of phenotypic characterization of T1 rice plants transformed with 
pEXPB9::AtSTZparaiog2. 



P EXPB9::AtSTZparalog2 


Parameter 


% increase in the transgenic plants 
compared to the nuliyzygous plants 


Number of filled seeds 


6.2 


Seed filing rate 


3.4 
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Harvest index 


3.9 


Number of panicles 


3.8 



The above results show that overexpression of the nucleic acid represented by SEQ ID NO: 28 
(encoding the 2XC2H2 zinc finger protein represented by SEQ ID NO 29) under the control of a 
promoter expressed in young expanding tissues (pEXP9) gives an increase in plant yield a 
shown by the increased in the number of filled seeds, the increase in the seed filing rate 
(proportion of filled seed with respect to the total number of seeds harvested per plant, increased 
in the harvest index and the increase in the number of panicles per plant. 
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Annex 6 

SEQ ID NO: 7 XDLNXXP 



Table 1. Percentage identity of the EAR motif as found in the in the amino acid of the sequence 
listing of the application as fiied and SEQ ID NO 7. 





Motif 


% Identity to SEQ ID 
NO: 7 (XDLNXXP) 


% identity to SEQ ID 
NO: 7 XDLNXP 


Q CT (~\\ r\h\(~\ 1 1 r^-a+io^a n lfM-nes.ro to 

ocUIUinU^I I _UaiiSCa_giGmeraIa 


phi mipa 


00 


inn 
IUU 


SEQIDNO J 3 JBIycinejnax 


FDLNIPA 


85 


100 


SEQIDNQJ 5„Medicago^sativa 


FDLNLPA 


85 


100 


SEQIDNO„17_Nicofiana_tabacum 


FDLNMPA 


85 


100 


SEQIDN0J9„0ryza„sativa 


FDLNLPA 


85 


100 


SEQ] DNO JJ7 jOryzajsativa 


FDLNLPA 


85 


100 


SEQ!DNO„23_Triticum__aestivum 


FDLNIPA 


85 


100 


SEQIDN0^27J\rabidopsisJhaliana 


IDLNLPA 


85 


100 


SEQlDNOJ39J\rabidopsisJhaliana 


LDLNLTP 


100 


83,3 


SEQIDNO^50j\rabidopsisJhaliaria 


LDLNLPA 


85 


100 


SEQIDNCL21 ^Petun ia Jiy brid a 


FDLNIPA 


85 


100 


SEQIDN0^25^Capsicum_annuum 


FDLNIPA 


85 


100 


3EQIDNO„29J\rabldopsisJhaIiana 


FDLNIPP 


85 


100 


SEQIDNOJ33J\rabidopsisJhaliana 


FDLNliP 


100 


83,3 



Table 2. Percentage identity of the EAR motif as found in the polypeptides recited by their 
sequence database accession number in lines 17 to 30 of page 15 and the paragraph bridging 
page 15 and 16 of the specification and SEQ ID NO 7. 





Motif 


% Identity to SEQ ID 
NO: 7 (XDLNXXP) 


% Identity to SEQ ID 
NO: 7 XDLNXP 


AAK01713 


FDLNLPA V 


85 


100 


AF332876 


FDLNLPAV 


85 


100 


AAM67193 


FDLNILPV 


100 


83,3 


NPJ99131 


FDLNISPV 


100 


83,3 


AF250336 


FDLNIPPI 


100 


100 


AY034998 


FDLNIPPI 


100 


100 


AY063006 


FDLNIPPI 


100 


100 


CAA64820 


FDLNIPPI 


100 


100 


NM_1 02538 


FDLNIPPI 


100 


100 


X95573 


FDLNIPPI 


100 


100 


CAA67228 


FDLNIPPI 


100 


100 


CAA67229 


FDLNIPPI 


100 


100 


X98670 


FDLNIPPI 


100 


100 


X98671 


FDLNIPPI 


100 


100 


BAC43454 


FDLNIPPI 


100 


100 


NP_1 96054 


FDLNIPPI 


100 


100 


BAA05079 


FDLNIPAL 


85 


100 


BAA05076 


FDLNMLPP 


100 


83,3 
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RA A0R077 


pni nmi pp 


I UU 


RR R 


RA A0R07R 


PHI Ml PP 


100 

I UU 


inn 

I UU 


MP 1Q0RR9 


PHI Ml PAH 

ruLIMLrnU 


RR 

OU 


inn 

I UU 


NP 1RRRQ9 


IHI Ni PA! 


RR 

OU 


inn 

I UU 


RAA1QQ9R 


I HI NI PAP 


RR 

OU 


inn 

I UU 


RAA901R7 


I ni ni pap 


RR 

OU 


100 

I UU 


RAA91Q1Q 


mi MIPAT 


RR 

OU 


100 

I UU 


rap:4roor 


ID1 NI PAP 


RR 

UU 


100 

l UU 


MP 1R90R7 


\D\ Ml PAP 


RR 

OU 


100 

l UU 


CAA67233 


LDLNLPAP 

l_ LJ L.I M L-l 1 


85 




PAAR79RR 


1 HI N! PAP 


RR 

OU 


100 

i UU 


RAA1Q1 1 1 

DAM I v3 I I I 


1 D! Ml PV^ 


RR 

OU 


100 

1 UU 


RAA91Q20 


1 Dl Ml PAP 


RR 

OU 


100 

i UU 


PAAAR1 1 1 


1 ni Ml PAP 
LULlMLrnr 


RR 
Ou 


100 

t UU 


RAA1Q1 10 
Dnn I c? I i U 


i ni mi tpi 


100 

I UU 


RR R 
00,0 




LDLNLTPL 


mo 

I UU 


RR R 
OOjO 


RARR7R7Q 


I ni ni tp^ 


inn 

I UU 


RR R 

O JjO 


RAA91Q91 


I ni Ml TPR 


ino 

I UU 


RR R 


RAA91Q99 


i ni mi tpp 


inn 

lUU 


RR R 
00,0 


CAA67230 


LDLNLTPL 


100 


RR R 


CAB67667 


LDLNLTPL 


100 


83,3 


BAA19114 


LDLNNLPT 


100 


83,3 


BAA21924 


LDLNLTPY 


100 


83,3 


BAA21925 


FDLNLTPY 


100 


83,3 


BAA21926 


FDLNLTPD 


100 


83,3 


CAM 8741 


LDLNVPPT 


100 


100 


CAB80245 


LDLNVPPT 


100 


100 



SEQ ID NO: 8 KRSKRXR 

Table 3. Percentage identity of the B-box motif as found in the in the amino acid of the sequence 
listing of the application as filed and SEQ ID NO 8. 





Motif 


%ldentitytoSEQID 
NO: 8 (KRSKRXR) 


IP_070_1 1 _Datisca_glomerata 


RKRSKRTR 


100 


IP_070_13_Giycine_max 


RKRSKRSR 


100 


IP_070_15_Medicago_sativa 


GKRSKRSR 


100 


I P J)70„ 1 7_H icotian a Jabacum 


GKRSKRPR 


100 


IPJ370J9J3ryzajsativa 


RKRSRRGR 


85 


I P_070_37_Oryza_sativa 


RKRSRRGR 


85 


IP_070_23„Triticum„aestivum 


RKRSRRQR 


85 


IP_070„27_ArabidopsisJha!iana 


RKRSKRQR 


100 


IP_070_31_Arabidopsis_thaliana 


RKRTKRHR 


85 


IP_070_35_Arabidopsis_thaliana 


RKRTKRQR 


85 


I P J)70„2 1 „Petu nia_hy brida 


GKRSKRQR 


100 


IP_070„25„Capsicum_annuum 


GKRSKRSR 


100 


IP_070_29_Arabidopsis_thaIiana 


SKGKRSKRSR 


100 
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lPJ)7Q^ 33J\rabidopsisJhaliana \ CKKRSKRSR 



100 



Table 4„ Percentage identity of the B-box motif as found In the polypeptides recited by their 
sequence database accession number in lines 17 to 30 of page 15 and the paragraph bridging 
page 15 and 16 of the specification and SEQ ID NO 8. 





Motif 


% Identity to SEQ ID 
NO; 8 fKRSKRXR) 


AAK01713 


KRSRRQR 


85 


AF332876 


KRSRRQR 


85 


AAM67193 


KKRSKRSR 


100 


NP 199131 


KKRSKRSR 


100 


AF250336 

1 \Z I— \J \J V/ \J 


KRSKRSR 


100 


AY034998 


KRSKRSR 


100 


AY063006 


KRSKRSR 


100 


CAA64820 


KRSKRSR 


100 


NM J 02538 


KRSKRSR 


100 


X95573 


KRSKRSR 


100 


CAA67228 


KRSKRSR 


100 


CAA67229 


KRSKRSR 


100 


X98670 


KRSKRSR 


100 


X98671 


KRSKRSR 


100 


BAC43454 


KRSKRSR 


100 


NP_1 96054 


KRSKRSR 


100 


, BAA05079 


KRSKRQR 


100 


BAA05076 


KRSKRPR 


100 


BAA05077 


KRSKRPR 


100 


BAA05078 


KRSKRPR 


100 


NP 190562 


KRTKRHR 


85 


r NP 201546 


KRTKRQR 


85 



SEQ iP NO: 9 EXEXXAXCLXXL 

Table 5. Percentage identity of the L-box motif as found in the in the amino acid of the sequence 
listing of the application as filed and SEQ ID NO 9. 





Motif 


% Identity to SEQ ID 
NO: 9 (EXEXXAXCLXXL) 


I P J37QJ 1 _Datisca„glomerata 


EEEYLAFCLIML 


100 


IP_070_1 3_Glycine_max 


EEEYLALCLIML 


100 


IP_070_1 5_Medicago_sativa 


EEEYLALCLIML 


100 


IP_070_17_Nicotiana_tabacum 


EEEYLALCLIML 


100 


I P_070_1 9_Oryza_sativa 


EEENLALCLLML 


100 


I PJ37Q_37„Oryza_sativa 


EEENLALCLLML 


100 


I P„070„23 JTriticuirLaestivum 


EEENLALCLLML 


100 


I P_070_27_Arabidopsis Jhaliana 


EEEYLALCLLML 


100 


I P_070_3 1 _Arabidopsis_thaliana 


EEEYLALCLLML 


100 


I P_07Q_35_Arabidopsis_thaliana 


EEEYLALCLLML 


100 
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I P J70_50_Arabidopsis_thaliana 


EEEDMAICLIML 


100 


I P_070_46_Arabidopsis_thaliana 


VEDTAAKCLMLL 


83,3 


I P_070_21 _Petunia_hybrida 


EEEYLALCUML 


100 


I P_070_25_Capsicum_annuum 


EEEYLALCLIML 


100 


I P_070_29_Arabidopsis Jhaliana 


EDEYIALCLMLL 


100 


IP_070_33_Arabidopsis_thaliana 


EEEYLAFCLMLL 


100 



Table 6- Percentage identity of the L-box motif as found in the polypeptides recited by their 
sequence database accession number in lines 17 to 30 of page 15 and the paragraph bridging 
page 15 and 16 of the specification and SEQ ID NO 9. 





Motif 


% Identity to SEQ ID 
NO; 9 (EXEXXAXCLXXL) 


AAK01713 


EEENLALCLLML 


100 


AF332876 


EEENLALCLLML 


100 


AAM67193 


EEEYLAFCLMLL 


100 


NPJ99131 


EEEYLAFCLMLL 


100 


AF250336 


EEEYLAFCLMLL 


100 


AY034998 


EEEYLAFCLMLL 


100 


AY063006 


EEEYLAFCLMLL 


100 


CAA64820 


EEEYLAFCLMLL 


100 


NMJ02538 


EEEYLAFCLMLL 


100 


X95573 


EEEYLAFCLMLL 


100 


CAA67228 


EEEYLAFWLMLL 


100 


CAA67229 


EEEYLAFWLMLL 


100 


X98670 


EEEYLAFWLMLL 


100 


X98671 


EEEYLAFWLMLL 


100 


fBAC43454 


EDEYIALCLMLL 


100 


NPJ 96054 


EDEYIALCLMLL 


100 


BAA05079 


EEEYLALCLIML 


100 


^JP 190562 


EEEYLALCLLML 


100 


NP_201546 


EEEYLALCLLML 


100 


NPJ88592 


EEEYLALCLLML 


100 


BAA19111 


DEEETARYLILM 


75 


BAA21920 


EEEDMANCLILL 


100 


CAA43111 


EDEDMANCLILL 


100 


BAA21921 


EAAAMANCLMLL 


91,6 


BAA21922 


EAEAMANCLMLL 


100 


CAA67231 


VDVTAANCLMLL 


83,3 


CAA67232 


VDVTAANCLMLL 


83,3 


CAA67234 


EEDTTAKCLMLL 


91,6 


CAA67235 


EEDTTAKCLMLL 


91,6 


CAB90936 


EEDTTAKCLMLL 


91,6 


CAB90935 


VEDTAAKCLMLL 


83,3 


BAA21925 


ETLAMANCVNIL 


83,3 


BAA21926 


ERLAMENCANIL 


83,3 
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